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1. INTRODUCTION 


A number of papers have been written in recent years on various examples 
and applications of the formula 


i) 40) + Ds = [ “fd +23 l ” 0s Bantf(t) dt. 


This formula is usually known as Poisson’s summation formula. Other forms 
of the formula are 


(1.2) ys f(n) = } . cos 2rnif(t) dt, 
and 
13) x smy= > f erpae. 


Further, if we write 
(1.4) g(x) = 2 , cos 2rxtf(t) dt, 
then, with suitable conditions, 
(15) fla) = 2 [ ” eos Qextg(t) dt 
by Fourier’s integral theorem, and (1.1) can be written 
rt) af(0) + 2 Hn) = 49(0) + Y on. 
Several writers have pointed out that it is often difficult to justify the appli- 
tation of the formula even in cases where the formal result can be shown by 


other methods to be true.! Most methods which have been used for proving 


sich formulae are elaborations of one of the two following types of formal 
procedure. 





'Erdelyi (1), 408, Kober (1), 612. 
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592 A. P. GUINAND 


(A). We may regard (1.2) as a Fourier series.” If we expand the periodic 
function 


DL fin+z2) 
as a Fourier series for —3 < x < 3, and then put z = 0 we formally obtain (1.2), 
(B). We may regard (1.3) as a result in the theory of analytic functions,’ [f 
f(z) is analytic, and we integrate 7 cot rzf(z) around the rectangle R with vertices 
N+4+ ai, —N — 3} + ai, then by Cauchy’s theorem 


— [ x cot nef(z) dz = X sn). 


r= 


Putting 
x cot rz = ri + Qri Do ™® 
n=1 
when 
I(z) < 0, 
and 
x cot rz = — at — 2m y R ° oes 
n=l 
when 
I(z) >0, 


and integrating term by term, we formally deduce (1.3) on making WN tend to 
infinity and a to zero. 

In addition to these methods two further methods have recently been de- 
veloped which admit of application to functions having various types of singu- 
larities. 

(C). Ferrar* derives a generalization of (1.1) by the use of Mellin transforms 
and the functional equation of the Riemann zeta-function. Conditions are 
applied to the Mellin transform of f(x) rather than to f(z) itself. 

(D). In a recent paper’ I gave another method for application to more general 
summation formulae of which (1.1) is a particular case. In this method we 
regard (1.1) as a Parseval theorem in the theory of Hankel transforms of func- 
tions of the Lebesgue class L’(0, ~ ), and prove: 

If f(x) is an integral, and both f(x) and xf’(x) belong to L?(0, ©), then 


(1.7) lim { > fr) - [ 10 a} = jim 1z g(n) — [ g(t) a, 


N-o (n= 





* Courant and Hilbert (1), 64-65. 

’ Kronecker (1), Landsberg (1). 

4 Ferrar (1). 

5 Guinand (1). Put a, = 1 in Theorem 2. 
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where g(x) satisfies 





_ 1 /f*sin 2rzy 
[ ow ay = - | 3 f(y) dy 


and is chosen so that it ts the integral of its derivative. 

The formula (1.7) obviously agrees with (1.6) if the series and integrals con- 
cerned converge. However, this is not necessary for the truth of (1.7), and it 
covers a number of examples in which (1.6) is inapplicable. 

In the present paper we simplify the above result slightly, and extend it to 
functions of the Lebesgue classes L’(0, ©), (1 < p< 2). Wealsogive examples 
to show the ease with which the result can be applied, and discuss extensions 
to sums involving primitive characters. 


2. PRELIMINARY LEMMAS 
Lemma 1.° If f(x) belongs to L?(0, ~) for some p satisfying 1 < p S 2, and 
py =p/(p—1), Riv) > —1, 
then there exists a function g(x) belonging to L? (0, ~) defined almost everywhere by 


(2.1) g(x) = Li.m. Qn I d (xt)! J,(Qmat)f(t) dt. 
Further 
(2.2) f(x) = lim. 2x i (xt)* J,(2nat)g(t) dt, 


and if fi(x), gi(x) and fe(x), ge(x) are two pairs of such transforms for the same 
values of p and v, then 


(2.3) ¢ filx)go(x) dx = I 4 fo(x)gi(x) dz. 


Lemma 2. If f(x) is an integral and tends to zero as x tends to infinity, and xf’ (x) 
belongs to L?(0, ©), (p > 1) then f(x) belongs to L?(0, ©), and x"f(x) tends to 
zero as x tends to +0 or infinity. 

Hardy’ has proved that, if ¥(x) belongs to L?(0, ~), so does 


ea) = [ vo, 
and 


t9(x) ~0asz > +0orz— ~. 


Putting 
v(x) = 2f'(z) 


the result: follows immediately. 


* Busbridge (1), or Kober (2). We only require the result for » = 3. 
' Titchmarsh (1), 396. 
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Lema 3.° The function x '{[x] — x} ds self-reciprocal with respect to the kernel 
Qrx'J;(2xx). That is 


Qn & f(t] — t} 2tt? Jy (Quat) dt = x {[z] — 2}. 
0 


Lemma 4. If f(x) satisfies the conditions of Lemma 2 then it has a transform g(x) 
with respect to the kernel 2 cos 2xx given for x > 0 by 


(2.4) g(x) = 2 | cos Qratf(t) dt. 
0 
Further 


f(z) =2 | % cos 2ratg(t) dt, 
0 


g(x) is an integral, both g(x) and xg'(x) belong to L?’(0, ©), «’g(x) tends to zero 
as x tends to +0 or ~, and 2f'(x), xg’(x) are a pair of transforms with respect to 
the kernel 2na*J 3(2rx) in the sense of Lemma 1. 

Put 


3 
Gi(x) = lim. 2 | cos 2ratf(t) dt 


To 0 


and 


Go(x) = Lim. 27 [ d (at)* J3(Qrat)tf’ (t) dt. 
0 


T—2 


By Lemmas | and 2 it follows that these expressions exist almost everywhere, 
and G(x), G2(x) both belong to L?’(0, ~). Now, if x > 0 


| ” | Glu) | ~ { [ | Ga(u) |? au) { [ e. iu} 
P ir Ole)” au)” a 


Hence the integral converges absolutely, and 





/ 


i * Ga(u) = <i | , = — " (ut)! Ty(Qeut)ef’() dl 
z z 0 


T—20 


T—2 


0 a 
~ teite | i J; (Qrut)f’"() dt 
z 0 


T C) 
=2rlim |] é’s’@ at [ u*J;(Qrut) du 


T—2 0 





8 Titchmarsh (2), 267. We also use the notation 


—co af 
= lim : 
a T—0 


a 
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1 T 
— lim sin 2ratf’(t) dt 


TIL Tx JO 


me lim {si Qrat f(t)]> — Irx . cos 2rat f(t) at 
TXL T-0 0 


=-—2 | cos 2rat f(t) dt. 
0 


The inversion of order of integration is justified by absolute convergence, and 
the integrated terms vanish by Lemma 2. Hence the limit (2.4) exists, and we 
may put 


g(x) = — [ G2(u) * = 2 . cos 2rat f(t) dt 


for all positive z. Hence g(x) is an integral and G2(r) = 2g’(x) almost every- 
where. I.e. af’(x), 2g’(x) are transforms of the required type, and the rest of 
the lemma follows from Lemma 2. 


3. THE SUMMATION FoRMULA 


We can now prove: 
THEOREM 1. If f(x) is an integral, tends to zero at infinity, and xf'(x) belongs to 
10, ~), (1 < p S 2) then 


(3.1) lim { Fil [ ” 40) a\ = te {z gin) — | gli) a} 
1 0 No 0 


No (n= n=1 


where 


g(x) = 2 [ # cos 2x f(t) dt. 
0 


Now x '{[z] — x} belongs to L?(0, ©) for any p > 1, so by Lemmas 3 and 4 
and (2.3) 


f; a {[x] — x} af'(x) dx = [ x '{[z] — x} af’(x) dz. 


The left-hand side is 


[ {{z] — x} f(x) dx = ({[z] — x} f(x)]lr — [ f(x) d{{z] — 2x} 


= — lim [ s@ atte — x} 


_—"* » jin) — [10 at. 


No \n=1 


Treating the right-hand side in the same way, we have (3.1). 
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However, Theorem 1 does not cover functions having discontinuities, for f( 2), 
being an integral, is necessarily continuous. In order to obtain a sufficiently 
general theorem covering such functions we combine Theorem 1 with a known 
form. of Poisson’s formula. 

THEorEM 2. If f(x) is of bounded variation in (0, ©), and tends to zero at in- 
finity, then 


tim [ Batsin +0) + yen — 0} — [90 at] = Xm - [ot 
where 


g(x) = 2 / we cos 2rzt f(t) dt 


exists as a Cauchy integral at zero and infinity. 
Titchmarsh’ has shown that with these conditions 


tim [Eat +0 +400} - [$0 a] = ¥ om) ~ 49040, 


Obviously 
N+} 


lim f) dt = 0, 


No YN 


and we can show that” 
0+0 =f gt a, 


Theorem 2 follows immediately. 

Now it sometimes happens that a function f(z) satisfies the conditions of 
Theorem 1 for one set of values of p in the neighborhood of the origin and for 
another set of values at infinity. In order to cover all these possibilities we will 
take as our general theorem: 

TuHroreM 3. If f(x) can be expressed as the sum of three functions f,(z), fal z), 
fs(x) such that all three tend to zero at infinity, f,(x) and fo(x) are integrals, afi(z) 
belongs to L”*(0, ©) for some p; in (1 < p, S 2), afo(ax) belongs to L”*(0, ©) for 


some p2in (1 < pe S 2), and f(x) is of bounded variation in (0, ~), then 


tim | Hm +0) +4 — 0} — [40 av| 
= lim | > g(n) — [. g(t) al, 


(3.2) 





® Titchmarsh (2), Theorem 45. 
1° Titchmarsh (2), Theorem 6. The required extension to the case z = 0 is trivial. 
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where 
g(x) = 2 | cos 2rat f(t) dt. 
—0 


(3.2) holds for fi(x), fo(x) and f3(xz) separately, and Theorem 3 follows by 
addition. 


4. EXAMPLES 


Many examples of Poisson’s summation formula have been noted by various 
writers. Most of these examples can be justified by Theorem 3 without diffi- 
culty. The examples given below are chosen to show the advantage of the use 
of Poisson’s formula in the form (1.7) rather than in any of the forms (1.1), 
(1.2), or (1.3). 

(i). Put 


f(x) =x“ (#<s< 1). 
The conditions of Theorem 3 are satisfied if we put 


f(x) = fi) + fala), 


pple OEP 
y= 
é* (221), 
0 (x < 1) 
(x) = ; 
fs a°—é¢* (x = 1), 
l1<n<s', m=2. 
Hence 
g(x) = 2 [ t* cos 2rat dt = 2(2r)** sin 4sxT(1 — 8s) 2°", 
0 
and 





lim 1 ane ats ~} = 2(2r)*? sin dsrT (1 _ 8) lim b> n* Px. v} 


No (n=l 1 No (n=l 8 
The expressions in brackets give the analytic continuation of ¢(s), and we have 
¢(s) = 2(2r)*” sin 4saI'(1 — s)¢(1 — 8), 


the functional equation of the Riemann zeta-function.” 
(ii). Consider 


(2) on gt etl? p p(n)e**"* (z > 0), 


n=0 





* Cf. Erdelyi (1), Kober (1), Bochner (1), 37-38, and Titchmarsh (2), 64-65. 
’ Various methods of deriving this result from Poisson’s formula have been given by 
other writers, but none is as direct as the above. Cf. Titchmarsh (2), 65. 
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where p(n) is the number of unrestricted partitions of n. Using the generating 
function” of p(n), we have 


C) 
-} /12 —2 —1 
(2) =z e”” I} (1 —_ —_ ‘ 
n= 
and 


log g(z) = —tlogz+ 5 <= } log (1 — etry, 


n=1 
Now put 
f(z) = —log (1 — &**™™) 


in Theorem 1. The conditions are satisfied with p = 2, and 


g(x) = —2 3 log (1 — e°*"*) cos 2rat dt 


i [fe —2amtt oos Qrat dt 
22, m 


= 2S ie ta. 
T m=1 


The term by term integration is justified by absolute convergence. If we apply 
Theorem 1 to this sum, we find that for z > 0 


sae g aamels a x 
m=. =e + 25 Onn 
a 1 eerie = —1 » Ls 
™: _- I + * Qrx2" 
Further 
zea l T 
90) = = man” ee’ 
Hence 
= 7 ] = —2enz) 
p> 0g ( . ) 12z 
1 - 1 Zz 
_ li f 2rn/z _ 1 ee 
Peni Pit te aia 2n sé} 


a Qrt/z -1 1 es an 
I ee te -2" = 2t saat | 
= : 1 (els be 1)? 4 Ly NZ 


n=1 1 ~ 72 


N-©o 





13 Hardy and Wright (1), 272. 
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Now 
. V (germis _ 1)" = > Py 2xmn/z 
n=1 1 n=1 71 m=1 
Cy ) 1 - , 
_ = xmn/z 
-Z n° 


oo 


=— ) log (1 — e **”"*), 
m=1 


and the above limit is equal to 
[ L (prtie _ +5 - esa | , (err pt — ha 
b \t 2nt 2rt? 
+ - lim Lf’ a — og. | 
N-o2 
1 
- fey 4h 
a [ ti , aiid 2u aap du 
ru se 1 
+f it (e a yt = gh au — 


= K — 3} logz, 


where K is a constant. Hence 


—)) log (1 — eo) — a = -> log (1 — e ?™"*) — si + iy —K + § logz. 
n=1 m=1 
That is 


1 
log y(z) = log (2) + ty — &. 


Putting z = 1 it follows that K = 4y, and 


g(z) = (2). 


rc) c) 
gt emli2 > p(n)e**"* = zien mt p(n)e?*""*, 
n=0 


n=0 


Hence 


This is a well-known result in the theories of partitions and elliptic modular 
functions. 


(iii). Put’ 
fi) = @+2)7 








“Cf. Watson (1). 
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then the conditions of Theorem 1 are satisfied with p = 2, and 
g(x) = 2 (2 + 2) cos 2natdt = 2Ko(2Qrzz). 
0 
Hence 
C2) 1 N N 
2 >> Ko(Qenz) — — = lim | (2 +n’)? - | (2+)? av| 
awl 2z N>olLn=1 0 
N P si 1 f N 1 
im | 4 +n’)? - 1h 4 a — Jog N} 
No Ln=1 n \Sin 
N/z 
+ {log N “ (1+ ua] 
0 
“(eee ter salta (eal 
-X\¢ +n’) are? lim log —tlotp 


= a G +n)? — \ + y + log iz. 


5. Serres INVOLVING PRIMITIVE CHARACTERS 


Poisson’s summation formula can be generalized by the introduction of the 
primitive characters as coefficients.” We require the following elementary prop- 
erties of characters: 

(i). If x(n) ts a character modulo k (k > 1) then 


x(m)x(n) = x(mn), 
x(m + rk) = x(m) 


where k, m, n, and r are integers. Further x(n) vanishes if n is not prime to k. 
(ii). If x(n) ts a primitive character modulo k (k > 1) then 


(5.1) > x(ner""* = e(x)kx(m), 
where 
(5.2) ex) =k? Y xine 


and X(m) is the conjugate of x(m). 
Now x(—1) = +1, since 


{x(-1)}* = x(1) = 1. 


16 Some particular cases of this type of generalization have been given by Ramanujan. 
Cf. Titchmarsh (2), 82. 

16 Landau (1), chapters XXII and XXX. The notation for e(x) used here differs from 
Landau’s by a factor —7 in the case x(—1) = —1. 
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Suppose that x(—1) = 1; then 


2 
p> x(n) sin — = 0, 


since 
x(k — r) = x(—r) = x(—1)x(r) = x(r), 


and consequently the terms n = randn = k — recancel. Also the term n = 3k, 
if it occurs, is zero since x(3k) = 0. Hence (5.1) becomes 


k 


63) > x(n) cos 7 = (xk x(n), 


n= 





Now if we put 


(-y<a<y), 





f(x) = k 
0 (|x| > y) 


n (1.2), we find that 


~ pie 2amn _ k< 1 


in oY 
- —— (nk 
—ysnsy Tv n=o nk + m +m k (n + m), 
where the prime indicates that the terms n = -+y are to be halved if y is an 


integer. If we multiply this result by x(m), and sum from m = 1 tom = k, 
we have 


2 x(m) ~’ cos? = ED xm ) > - 


—vsnsy n=—0 





7 sin ay (nk + m). 


Go! FE’ xm) =F SE HF E+ ain 2° (nk + m) 


—ysns<y T ml naceo nk+m~ 


Hence 


dH DO’ x(n) = * 3X sin PaMY 
lgnsv Tna=l1 1 


Now let us consider the integral 


(5.4) ark [ ot 2’ x@} (2) 4 | 72) dt. 
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We have 
na! "| _~ , (n)} t ~ I ini di 
k lsnx<t k 


= ES bx) + x@) +o + xem) [1 (7) a 








} w-1 
: > {x(1) + x(2) + eee + +x}! sin = - ~ 5 “~ bi 4 | 


4 oN . 
_¥ > x(n) oan _ Kk ie 2nNx > x(n). 


TL Axt on k= «Na 
If we make N tend to infinity the integral (5.4) becomes 
é< x(n). 2nnx _ 
ee a le 2 X(n)}. 


That is the functions 


a*{ de’ x(n}, e(x)a™ a D’ x(n)} 


l<n<z snc 


are transforms with respect to the kernel 


Qra’  (Qrx 
. i (7). 
If we now apply the methods of section 3 we can obtain a result analogous to 
Theorem 1. We can also obtain a result corresponding to the case x(—1) = —1. 


We have: 
THEOREM 4. If x(n) is a primitive character modulo k (k > 1), f(x) is an integral, 
tends to zero at infinity, xf'(x) belongs to L?(0, ©), (1 < p S 2), and either 


(i) x(-1) = 

g(e) = 2k | cos 28 f(a, 
or (ii) x(-1) = -1, 

g(e) = —2ielx)k* [sin 22 (0 a, 
then 


b> x(n)f(n) = } x(n)g(n). 


B.i or real primitive characters x(n) = x(n), and it follows on onesie (x i 
e **'/* = g(x) in case (i) that e(x) = 1. Similarly in case (ii) f(z) = 
g(x) gives —ze(x) = 1, and we have: 








then 


TR 
=a 


pe by 
—_— —_“- 
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Tuzorem 5. If, in addition to the assumptions of Theorem 4, x(n) is a real 
character, and either 


(i) x(-1) = 1, 

g(a) = 2k4 I — = SO dt, 
or (ii) x(-1) = —1, 

g(x) = 2K [ om — f(t) dt, 
then 


Y x(ndgln) = Ye xlng(n). 


PrincETON, NEW JERSEY. 
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ON THE REARRANGEMENT OF CONDITIONALLY CONVERGENT SERIES 


By C. T. RasJaGopau 
(Received June 25, 1940) 


This note is concerned with an application of the idea which finds expression 
in 

LitrLeEwoop’s THEOREM [2]. Jf F(x) is positive monotone decreasing and 
{D,} ts a sequence such that 


0=D<Di< De<.::- <D,z— ~, 
ni- oo, 
D, 7 D,-1 = O(1), 


then >.° (Dn — Dn-1)F(D») converges or diverges with | F(x) dz. 


In an earlier note [6], I have shown that the idea behind the theorem—that 
of a condensation effected on series by integrating a suitable function over in- 
tervals of the type (D,_1 , D,)—supplies the basis for a unified theory of con- 
vergence criteria for series of positive terms, more comprehensive than Prings- 
heim’s [3] in certain respects. In another note [7], I have shown that the idea 
suggests elegant proofs of certain classic theorems of Pringsheim [4]. I prove 
here that the same idea can be used to generalize the enunciations and simplify 
the proofs of Pringsheim’s theorems concerning the rearrangement of condi- 
tionally convergent series [5]. 


1. Notation 
S = So%-1(-1)""a,(a, > 0) is a conditionally convergent series, 1 = 
> -1 (—1)”a,. *_1(—1)""a, is a rearrangement of s in which 84. = 


> 7" (—1)’"a; contains the first p positive terms and the first q negative 
terms of s. 

1.1. The theorems which follow are typical of the results obtained by Prings- 
heim. It is supposed in their enunciations that a, = a(n), where a(z) is a dif- 
ferentiable function of x, and that limiting relations exist, as specified, between 
p and q. 

TuErorEM A. If, in S = )°f(—1)""a,, a, and na, tend steadily to 
infinity with n, then >>~ (—1)""a, has a sum s’ given by 

s'=s+ lim (p — q)dy. 


p>q-~ 





1 The numbers in [ ] denote the references at the end of this note. A slight generaliza- 
tion of Littlewood’s theorem, which dispenses with the condition: D, — Dn: = 0(1), ” 
given in de la Vallée-Poussin, Cours d’analyse, 1 (Louvain-Paris, 1926), 398. A generaliza- 
tion in a different direction is given in Rajagopal [8], §2. 

2 All the references to Pringsheim in the sequel are to [5]. 
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SSR St 


TueorEM B. Jf, in s, lim na, ts finite, equal to g (say), then 


no 


= s+ J lim (log 2p — log 29). 
2 », 


qo 


TuroreM B’. If, in s, lim n-ln-hn --- lz in-an = g (where ln = log n, 


no 


Ln = log log n, ---), then 


s' = s+ I lim (l,2p — 1,2q). 
2 p,q 

TueorEM C. If, in s, na, tends steadily to the limit zero and if (p — q)deq = 
0(1) (p > g— &), then 8’ = 8. 

Theorems A, B, C, B’ are suggested by (26), (30), (31), (34) respectively in 
Pringsheim’s paper. The restriction on a, in all the theorems is of the form: ' 
lim,+,.f()-@» is finite, where f(x) is a positive differentiable function for all 
large x. The distinction between the theorems arises from the fact that in 
Theorem A, lim f(n)/n = 0; in Theorem B, lim f(n)/n is finite and non-zero; in 
Theorems B’ and C, lim f(n)/n = «©. A question, suggested by Littlewood’s . 
theorem, is whether we cannot with advantage replace f(n) by f(D,,) in the con- | 
dition imposed on a,. An attempt at an answer is contained in Theorems 1 
and la which are found to cover all the cases discussed by Pringsheim. 

In all the theorems and the lemmas proved below it is tacitly assumed 
thatO0 < d, = D, — Dai < K (fixed). 


2 


TurorEM 1. Jf f(x) is a positive function with a continuous derivative f'(2) 


7 

such nat = o(1) (rx > a and if iim De Des g, then : (—1)""a, 
manne 2p 

oscillates between s + g lim 


P.q—eo Dag it ia) 


> eanepne nis : 





, ided is bounded. 
i bn HO 


PRoor: 


D, — D,_2 'y dx me | 1 As | 
= yy tN d = D,_ ). 
~jD.) eb. iw |" ' 
Integrating by parts the last term of this identity, we find 


D, — Dis _ Dy, dx Dy af | 
ey Le)...“ D2) | 3 





Po dx 
“ 7 — 2); 
Lies * OL, ie ' 
whence, given e > 0 arbitrarily small, we can find 7 such that 
Dy 
(1) — dx = M%). 
o- of je <e<otel tm | RM 


Pa nee 
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If for the sake of definiteness, we suppose p > gq, 


p+a 


2q 
(2) DY (-1)""a, = 2 (—1)" a, + (Geqp1 + Gegs + +++ + dep). 


v=1 


Hence, if 2g + 1 2 m, summing (1) for » = 2g + 1, 2¢ + 3, --- 2p - 1, 
we obtain from (2), 


Dep-1 
banectlrtas io’ 
(8) Sree + dx 
> S24 + (g — €) Dees IC)’ 





where we can write 
(4) ieee se AS ie ie ee * ds 
Dag-1 J(2) Deg J(2) Deg-1 J (2) Dep-1 J (2) 
and show that the second and third terms of the right hand member are 0(1) as 


Stig. 


First, when 0 < v — u = O(1) (u— &), 














foe) _ f['f@, _ poe 
108 Fa) "Lie dx = o(1) (u ). 
Or, 
(5) fr) ~ fu). 
Next, 
ee dx oe Don — Deni 1 
| ie 7 f{_ Dena +> O(Don a Don-1)} ia Sas 
Don Wel Don-1 > 0 
ee te = 
in virtue of (5). 
Also, 
Deon, Cd Doni Don hig Don_2 
0 = ~ 2), 
< TDn) < f(D») o(1) (n 
in consequence of the restriction ae = 0(1) (N > ~) which results 
_ an . 
from dy = o(1) and (Dy = Dna) ~ g; whence 
me ds ve — 0), 
i Sia 7 Y 


We can now complete the proof by using (6) in (4) and (4) in (3). 
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In certain applications it is useful to have the following complement to 


Theorem 1. 

TurorEM la. If, in Theorem 1, we replace the restriction relating to f'(x) by 
the assumption that f(x) tends steadily to ~ with x, then either 8y4¢ — Sq (p > 9), 
Sop (q > p), has an asymptotic representation which oscillates between 





or Sp4¢ ym 2 Pw de 
P dx : » : 
; as al —— 1s bounded. 
g im J Flay re Jo, Fay? oun 


Proor: Supposing (as before) p > gq, 





Dog1 — Dog-1 _, Dears — Dog+41 Bilan ne Dop-1 — Dey-s ™ ie dz + o(1) 





~ f(Drgs1) f( Deets) f(Dep-) Dee-1 J (2) 
(q— ©). 
Par-! de , 
As in (4), we can split / j@) into three terms of which the last two are o(1) 
Deq-1 


as Pp, q— ©, in virtue of the condition: f(x) — © with z. 
Hence, as g > ©, 








Deroy — Dag Dep» — D 
Arg41 + Ags + ++ + Api ~ af “(Das)” : feeet ead 


_ pt de 
=?) ie 


, , =oivt ) FOR idig 
which oscillates between g lim [ 


Deq f(z) 

Remark. The argument employed to prove Theorems 1 and 1a shows that, 
under the conditions of these theorems, the sum s’ of the rearranged series 
is given by 

Pw dz 
(7) =stglim/ 
Fe Song I) 
provided this limit exists and is finite. If the limit is infinite, then 
>t (-1)""a), diverges. 

2.1. We may consider, in the light of this remark, series of the type in 
Pringsheim’s paper, in forms slightly more general than his. The generalized 
series are all of the kind whose conditional convergence we can establish by 


Lemma 1. If,asz— ©, f(x) decreases steadily to the limit zero and | F(x) dx 


is divergent, then >>” (—1)" F(D,) is conditionally convergent. 

Proor: The convergence of the series in question follows from Leibnitz’s 
rule for alternating series. 

To prove that the convergence is not absolute, we observe that KF(D,) > 
(D, — D,1)F(Dz) which, by Littlewood’s theorem, is the general term of a 
divergent, series. 








; 
& | 
a 
f 


i, 
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Examples of the lemma are furnished by >>” (—1)"~a, , where a, may be 
any one of the functions: 


1 Ew log Dy, 1 1 A,B> 0; 
D* (0<y4 SZ 1), log D,’ Dn ’ D,-log D,’ AD, + BD* (; > u ey 


A second proposition from which we can infer the conditional convergence 
of a certain type of alternating series is 
Lemma 2. If u, > 0 ts such that 


Un/dy 1 _ 44 _ (de 
" ree eae rte 


then (i) n My Un 8 convergent or divergent according as » > 0, or < 1; (ii) 
y Sg (—1)""u,/d, is convergent for u > 0, (iii) the convergence in (ii) is condi- 
tional forO <u Sl. 

This result is at once an extension and an amplification of Gauss’s test. 





, ” ' , me: z, : 
(i) and (ii) are immediate consequences of the relation: ne cD,,", equivalent 


to (8)°. 

(iii) follows from the fact that un/dn > un/k which, by (i). is the general term 
of a divergent series when »p S 1. 

As an application of Lemma 2, consider the case: 


[[ui(ed, + D,)(8d, + D,) 
" [[P-ilvd, + D,)(6d, + D,) 
where a, 8, y, 6 are real and the two latter  —D,/d, (n = 1, 2,---). Itis 


clear that >> u, reduces to the hypergeometric type when d, = 1, 6=0. 
Further, 


n 





Un = 


Un/dn  _ og _. _ pan _ nf dn 
ia. eer Re 0(3). 


n 


The terms of >>” u, being ultimately of the same sign and the terms of 
>” (—1)""u,/d, alternately positive and negative, Lemma 2 shows that 


(9) ; —1)""q, = S ayn-1 []p-1(ad, + D,)(8d, + Dy) 
alia lihiteciil T[t-i(vd, + D,)(8d, — D,) 


ts conditionally convergent forO <y +6—a—£B SX 1. 
REMARK. 4a, in (9) can be related to a generalization of Gauss’s function: 





z 











T(z) = lim a 
saad +9(1 + 5) a0 (1 3) ee ae ee 
2 n 
= (1 + z) 
j | * Lemma 2 (i) is, in fact, a particular case of a theorem first proved by me in Bull. Amer. 


Math. Soc. 43 (1937), 411. But the proof of the theorem given there is not applicable to 
Lemma 2 (ii). 
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“ 
f 
: 
: 


whose reciprocal is, in Weierstrass’s form: 


J al (1 + 2am, 


II(z) n=l 


C being Euler’s constant. 
The generalization of II(z) referred to is due to Cesaro’. It is given by 


Di 
Tya,)(z) = lim F F . 
no 1 2 n 
(10) (1+ epi ter) (1+27) 


“ Dy — Dz 
(< gt  & ? ds ’ eee ) e 
Its reciprocal can be expressed in the form: 


1 Cid.) +Z Il ( ‘) —Zd,!/D. 
=e “nr 1+z—]je™"™" 
TI(a,) (2) nol D, 








i 
ef} 
; 
a | 
TF 
ii 





where Cia,) = lim ( = — log D.) is a constant with some of the elementary 
no \v=l1 v 


properties and application of C.° 
(10) shows that a, in (9) is subject to the condition 


(11) lim a,-D7** 7% = Thea, (¥) - Way (8) 
elie: TI (a,) (a) « H¢a,) (8) 


This fact will be used later (§3.1). 

2.2. Taking the series in §2.1, we can find out how they are affected by par- 
ticular rearrangements, supposing that in all of them appearing as examples, 
d,~k>O(n— @), 

Examptes. 1) Let >of (—1)" "an >? (-1)""/D,-log D,. Taking 
f(z) = x-log x in Theorem 1, we find g = 1/2k. Hence, by (7), 


s' —s= BS lim i dx _ = tim log (=?) 
2k pao de, t-logx 2k log Dog 

provided this limit exists. 

In particular, if 0 < lim (De,/Deq) < ©, there is no alternation in the sum 
of the series, due to the rearrangement. 

Next, if De» ~ Dbq (X > 1), the alteration is (log \)/2k. 

Finally, if D., ~ \”2* (\ > 1), the rearranged series diverges to ©. 

2) Let '? (—1)""a, = oF (—1)""/AD, + BD* (A, B > 0;1 > 4 > 0). 








‘The function is mentioned in E. T. Whittaker, Modern Analysis (Cambridge, 1902), 
195. The notation in Whittaker is, however, different from the one employed here. 
*E.g., Rajagopal [8], §3. 
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With f(z) = Ax + Bz", g = 1/2k so that 


Dep dz 1 r D:;" Dey + B B 
] 
i Az + Bo %AGQ—n) 06 Di,* +B 


s'’-—8 : lim 
~ Ok Piqro 





if this limit exists. 


Thus, if Dz» ~ bD», , the alteration in the sum of the series is (log b)/2kA. 








n-1 log Dy 
3) Let > (- a“, aim Sidi If f(z) = | age sails 
1 D, 2k 
; _ 1 (log ~~ 
‘ shia 2k pad | 2 Doq 
if the limit exists. 
Suppose that asp > q—> ~, 
Dep a + o(1) 
——- == |] + —_____, 
Dog + log Dog 


Then a simple calculation shows that the alteration in the sum of the series 
is a/2k. 


3 


These examples typify, in the order of their discussion, the three cases: 
lim,.,, f(z)/z = ©, 0 < lim f(x)/x < ©, lim f(x)/x = 0. The function f(z) 


which figures in them is such that Pe ji) can be readily evaluated. Under cer- 


tain conditions, it is possible to find s’ — s without evaluating the integral. 
The conditions are set forth below in three theorems which reflect the differences 
between the three types of restriction on f(z). 


THEorEM 2. Jf lim ——* ‘Dn _ g, then s’ — s = g lim log G) provided 
2q 


noo D, — D,-2 P.q7eo 





the limit exists. 
Proor. Take f(x) = z in (7) 


Corotuary. Setting D, = n, lin, hn --- successively, we obtain Theorems 
B and B’. 
THEOREM 3. If f(x)/x tends steadily to © with x, 
D,) Do» — D 
rip D,, are D,-2 = 9%, f(Deq) ( ) (p ¢ 


then s is unaltered by the rearrangement.® 





® It is obvious that Theorems 3, 4 and all deductions therefrom can be suitably restated 
in the case q > p. 
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Proor. Setting f(z) = x-y(x), we have ' 
Pe dz 1 Pop dx 
a -~ 0<60<1 
ve H@) oD + 0D — Duy} Jr, tS ) 
log 1 ae o(Ds)} | 
= f(Drq) _ 0(1) (q _ oo), 
¢{Deq + O(D2p — Drg)} | 
sO The desired result now follows from Theorem la. 
CoroLtary. The particular case D, = n contains Theorem C. 
More generally, we may take D, = ln, hn, --- 
Tuzorem'4. If, in Theorem 1, f(x) satisfies the additional condition 
lim f’(x) = 0, , 
then s’ — s = 0 or ag or © according as lim By — Obs .. Gernare. 1 


p>qro f(Drg) 
Proor. Since 


es Dap de _ Dep — Dag 0 <y<0<1) : 
Doq f(z) f(Deq) + 6(Dop rT Deg) f’ { Dog + n( Dep _ Deoq)} 0 , | 


the result sought is an immediate consequence of (7). 
Remark. To bring out the relation between Theorems 3 and 4, we need only 





Ss: 


r) vary the form of the hypotheses in the latter to the extent of replacing the . 
restriction on f’(x) by the conditions: f(x) tends steadily to © with x and f(x)/zx 

T= tends steadily to zero. These conditions imply that lim f’(z) = 0 and there- 

iL fore, our previous proof of Theorem 4 is still valid. 


CoroLtary 1. When D, = n, an = 1/f(n) in the special form of Theorem 4 
noticed above, we have Theorem A. 
Corotuary 2. Let D, = ln (x = 1), 


es 


ed 
f(x) = (bex)”*(hepra)*** «++ (hx) E(x) = (2) (say) 
where k = 1, the ms are positive or negative numbers, é(z) has a con- 
- tinuous derivative and is such that &(z) — ©, £’(x) = O(1/x) (t# > ~). Theo- 


rem 4 then gives the following equivalent of (38a) and (38b) in Pringsheim’s 
paper: 
If limn.,, n-lyn- ben - - - l,yn-Y(l,n)-dn = g (hn =n, 22 1), then s’ —s =0 


or ag/2 or « 
ag/2 or © according as lim WL 29) 
Corottary 3. Let D, = ln (x = 1), f(x) = 2" ”’-¥(x) (0 > 0). We have 
then the result: 
Tf limn,, bn-ln-ln +++ I n- (len) ?-W(Len)-dn = g (lon = n, x 2 1), then 
’ 1e(2p) — te(2q) 


s’’— s = 0 or ag/2 or ~ according as lim 
ag / ng as linn 29)"?-w(le29) 


= Ooraor ~. 











= Qoraor ~, 
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This is equivalent to (44) in Pringsheim’s paper. 

Notre. Theorem 4 thus includes two of the limiting cases discussed by 
Pringsheim: lim na, = © as in Corollary 1, lim na, = 0 as in Corollaries 2 and 3. 

3.1. I conclude with some applications of the theorems in §3. It is assumed 
in all these applications that d, ~ k as in §2.2. 

1) If the series >? (—1)""/D,-log D, is rearranged so that D., = 
O(De,:log Deg) (p > ¢ > ©), then its sum is unaffected by the rearrangement 
(Theorem 3). 

2) If the series }-? (—1)""/log D, is rearranged so that D,, — D,, ~ 
a-log Deg (p > q — @), then the alteration in its sum is a/2k (Theorem 4), 

3) Consider the series in (9), introducing the additional restriction d,, ~ k, 
(11) then gives 

lim et = 1 Daw) Man) , 
Dn — Dre 2K Wa, () Ha, (8) 


so that if0 < y +6—a-— B <1, a, is subject to the hypothesis in Theorem 4; 
with f(z) = 27? **: and if y + 6 — a — B = 1, an is asin Theorem 2. Hence 
we are led to formulate: 

Suppose that for the series in (9) dn ~ k. 

(i) Jf0<y+é6—a—B < land tf the series is rearranged so that Do, — Dog ~ 
aDj*** (p > q — ~), then the alteration in the sum of the series due to the 
rearrangement is 





@ Way (y)- Wa,» () 
2k Ta, (x) «1 ¢a,) (8) 
(ii) Ify +6 — a— B= 1 and Zf in the rearrangement Do, ~ bDe, (p > ¢ > ~), 
log b Wea,)(y) - Wea, (6) 
2k Wa,) (a) - Mya, (8) 
In the particular case d, = 1, we have: 
Suppose that the series 


(apt @ + Da + 2)-- (a + 0)-6 + YG + 2)---B+0) 

, (y + 1) + 2)-+-(y + )-(6 + 1) + 2)---(6 + 0) 
where a, 8, y, 6 are real and y, 6 # —1, —2,---, is rearranged (i) in the case 
0<y+é6-—a-—8 <1, 80 thatp—qw~aq””*” (p > q— &), (ii) in the 
casey + 6—a— B= 1, 80 that p ~ bq (p,q— ©). Then the alterations in the 
sum of the series in (i) and (ii) are respectively es e * eer +e and 
log bT(1 + y)-F(1 + 8) 

2 Tl+a)-T(1 + 8)’ 
This result in a somewhat less general form has been proved from first prin- 
ciples by B. C. Das [1]. 








then the alteration is 











Mapras CHRISTIAN COLLEGE. 
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ON MEAN ONE-VALENT FUNCTIONS 


By D. C. Spencer! 


(Received October 20, 1939) 


1. In several recent papers’ I have shown that many of the properties of 
p-valent functions are possessed by the wider family of functions which are 
p-valent on the average. The present paper is a study of the finer structure of 
functions one-valent on the average in which it is shown that such functions 
possess some of the more striking properties of schlicht functions. The method 
of the paper also yields two results of a more general nature which are stated in 
§’s (7) and (8) below (Theorems 3 and 4). The main results have been stated 
without proof elsewhere.’ 


2. For completeness we begin by restating the definition of mean p-valency. 
We suppose that f(z) is regular in | z| < 1, and that W is the Riemann domain 
which is the transform of |z| < 1 by f(z). Let W(R) be the area (multiply 
covered regions being counted multiply) of that portion of W which lies in the 
circle |w| < R. Then if 


(2.1) | W(R) < prR? 


for all R > 0, we say that f(z) is mean p-valent. 

It is convenient to express the inequality (2.1) analytically. Let n(r, w) be 
the number of times (necessarily bounded by some number depending upon r) 
that f(z) takes the value win |z| <r. We define 


(2.2) pr, R) = 2 [n(r, Re) dy; 
(2.3) n(w) = lim n(r, w); p(R) = p(i, R) = lim p(r, R). 
Then 


R T . 
W(R) = lim | [ n(r, Re)R dR dy 
r—1 /90 — 7 


(2.4) ri ; ; 
«{ (im 3 [mG Re iv) a(rk’) = | p(R) dle’, 


ral 2a 





1 Tam indebted to Mr. K. J. Arnold for making the drawing which is reproduced in Fig. 4 
below, on the original of which 679 values of the function K(z) = 42/(1 + 2)? were plotted. 

* For example, references (2) and (3) at the end of the paper. 

3 Spencer (5). 
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and so (2.1) may be written in the form ; 
(2.5) I : p(R) d(rR*) < prR? (R > 0). 

If we omit the averaging at (2.2), and suppose that 
(2.6) I * n(Re) d(wR?) < pak? 
for all R > 0, uniformly in y, then we say that f is strongly mean p-valent.* 


Fig. 1 represents a domain W which is strongly mean l-valent (a fortiori mean 
l-valent), but which is not schlicht. 


t 
- 
ei) 
| 
: 
4 
t 





Fie. 1 


3. It will be convenient to sum up here for future reference certain trivial 
deductions from mean 1-valency. 
Lemma 1. Suppose that 


f(z) = az + ae’ +--- 
is mean 1-valent, and let 
fle 


z 


d= Inf 

jz|<1 

Then d > 0, and n(w) = 1 if 0 S$ |w| <d. Furthermore, there exists a real 
number 8 such that n(de) = 0. 


. 














‘ Strong mean p-valency is therefore roughly midway in strength between p-valency and 
mean p-valency. 
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In fact, since f(0) = 0, there exists a circle of radius 6 > 0, and center w = 0, 
every point P of which is covered at least once by W. Since, by the definition 
of mean 1-valency, it is covered on the average at most once, it follows that the 
circle is covered exactly once by W. Hence d, the distance of the boundary 
of W from w = 0, satisfies d 2 6 > 0. 

Suppose now that 


(3.1) n(de®) > 0 


for every 6. To every 8, —t S 6 < 7, there then corresponds a circle of 
radius p = p(B) and center de” every point of which is covered at least once, 
and it follows from the Heine-Borel theorem that there is a number e > 0 such 
that n(w) = lfor|w|<d+e. But there is then at least one point w , | w | < 
d + e, for which 


(3.2) n(wo) > 1. 


Otherwise the intersection of W with the circle | w| < d + ¢ would be schlicht, 
and its boundary points in | w| < d + e would therefore be uncovered, contra- 
dicting (3.1). (3.2) implies that some neighborhood of wp is covered at least 
twice, and, since every point of the circle | w| < d + eis covered at least once, 
we see that 

d+e 


Wad+o=[ plR)d@R) > xd +0) 
0 
This is impossible by the hypothesis of mean 1-valency, and hence (3.1) is false. 


4. We come now to the first result: 
THEOREM 1. Suppose that 


f(z) = az + a+... 
1s mean l-valent. Then 


and equality occurs if, and only if, f(z) = az/(1 + yz)’, |y| = 1. 

We write z = re’*, w = f(z) = Re’*. Then Theorem 1 follows quite easily 
from the following lemma, which is a generalization of the Bierberbach-Faber 
Fléchensatz. 


Lemma 2. [If f(z) is mean 1-valent and if a > 0, then 
(4.2) / R“*d@ =0 (r < 1). 
jzl=r 
Lemma 2 when f is schlicht has been given by Prawitz,5 who has used 


it to prove (4.1) for schlicht functions. The deduction of (4.1) from Lemma 2 
is the same in our case, and I omit the calculations. 





5See reference (2) at the end of the paper. 
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Let : 
. We suppose that 0 < r < 1. 

: then, to prove Lemma 2 . 

be pop si of |z| < rbyf, B(r) its boundary (so go 

cen of |z| = r); and let d(r) (see Fig. 2)* be the distance ry) sexe 

ak the function f(rz) (with picture W(r)) is a fortiori mean 1-valent. 

w= 0. 


0, 
on 
he 


ry 


of 


th 


it, 


st 
e, 


e. 


ly 
er 





- int of W(r), 
ed and since f(0) = 0, d(r) > 0 by Lemma 1. Let P _ m i - “ae 
2 and let W’(r) (see Fig. 3) be the domain composed of the p 
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Then W’(r) contains the point w = ©, but is bounded (“internally”) by a closed 
(but not necessarily simple) curve B’(r) lying in the circle |w| < [d(r)}4**. 
Let W’(r, R) be the area (multiply covered areas being counted multiply) of 
that portion of W’(r) which lies in the circle | w| < R. Then 


(4.3) ; ies R“d® = rd “(r) — W'(r, d**(r)), 


and the lemma asserts that this is non-negative. 
We have, expressing (4.3) in terms of p(R), 


—ba 
lf 3a = | (1 = plr, RB?) d(aR?) 
2 jzl—=r 0 


pre Pa 2/ a 

= arf a - ve, RB 
and so, dropping the parameter r from our notation, 
(4.4) = 30 [ (1 — p(B) SE = edt — af ple) oe. 


Integrating by parts with respect to p(R), and observing that the integrated 
part vanishes at the limits d and ~, we have 


[joc BE = a +a) f° ([ p@an) 


Since, by Lemma 1, p(R) = 1 for R < d, we have 


[- p(R) dR = F p(R) dR — d. 


But, integrating by parts, 


[paar = ["rcor-@ = 1 [" parar + ["([ r@reae)F 


R, , 1 
at a4 = 
25 +5 4 "aR, Ri 


by the hypothesis of mean I-valency. Hence 


Ri Ry . 
p(k) ar = | p(R)dR -—d<R, —4d, 
d 0 








and so 


(4.5) [pte o 





: 


dR 1+a >; « lie 
=. * sta) | (a-a? = ter ar = 5 


Substituting from (4.5) into (4.4), we obtain 


; __Rtde 2 xd) — xd “() = 0, 
[2 =f 


and this proves Lemma 2. 
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By a suitable choice of a in Lemma 2 it can be shown (as Prawitz has pointed 
out in the case of schlicht functions) that 


|as| S 3.05 | a, |. 


I have not attempted to prove that | a;| < 3{|a,| (with equality only for the 
Koebe function), but it seems to me that this result must certainly be true 
(though possibly difficult to prove). 


5. The two most important properties of 1-valency (more generally of 
p-valency) are: (i) an invariance to one-valent transformations; and (ii) the 
metrical property that large areas cannot accumulate in small neighborhoods.? 
Of these mean 1-valency (more generally mean p-valency) possesses only the 
second; the group property (i) is lacking. In fact, mean p-valency is not even 
an invariant to translations, the simplest transformations of all. When f is 
1-valent (schlicht), it is possible by use of the property (i) to deduce immediately 
from Theorem 1 that 
(5.1) d=tla|, 
where d is the distance of the boundary of the map W from w = 0; but when 
fis only mean 1-valent the argument breaks down completely. The result is 
that I am able to prove (5.1) only for strongly mean 1-valent functions, and 
the precise inequality for mean l-valent functions remains undetermined. 
However, under mean 1-valency the weaker result is true that d = |a,|/A, 
where A is an absolute constant less than 7; and more detailed computations 
indicate (but in their present form do not prove) that the result is still true when 
A is only slightly greater than 4. A discussion of this problem is to be found 
at the end of the paper. 


6. We now take up the result (5.1) for strongly mean 1-valent functions. 
Afterwards we state and prove the two general theorems mentioned in §1 
(Theorems 3 and 4 below), the hypotheses of which are essentially the conclusion 
of Lemma 1 (here only a vestige of mean I-valency appears). 

THrorEM 2. If f(z) is strongly mean 1-valent, then 

d24\|a| 
with equality if, and only if, f(z) = az/(1 + yz)’, |y| = 1. 
The following result is almost an immediate consequence of Theorem 2: 
Corotuary. If f(z) is strongly mean 1-valent, then 


lf@|= 
and there is equality if, and only if, 
f(z) = az/(1 + yz)’, ly| = 1. 
"It is the distribution of the area of the map with respect to w = 0 which determines 


the magnitude of the mean values of | f | and | f’ | (and so of the upper bounds of the co- 
efficients). 


| ai|-|2| 


(1 + |2])” 
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In fact, more generally, the inequality 


4 |a|-|z| 
2S ee eee 
| f(z) | = A (i+ |z|)? 
is a consequence of d = |a:|/A for any f. I omit a proof of this statement 
since the proof is substantially the same as in the well-known case when f is 
schlicht (see R. Nevanlinna, Eindentige analytische Funktionen, Berlin, 1936). 
In the sequel we shall make repeated use of the following lemma. Suppose that 


g(z) = biz + bee? + --- 


is regular in |z| < 1. Let G be the map of the unit circle by g, and let G* be 

the “star” of G with respect to the point g(0).8 We write Z* = g(G*), and 

denote by pi (R), p2(R) the valency functions of G* and Z* respectively? 
LEMMA 3. 


6.1) ig hi < [le Rat—pi@y) + [le Ral—pt ce) 


unless g = byz. 

(6.1), in different notation, is an inequality of Bermant.° 

The lemma remains true if g is regular only in Z*, but ts false in general if this 
condition is not satisfied. 

To prove Theorem 2 we suppose that d = 1, and prove that then | a | < 4, 
a statement which is plainly equivalent to the theorem. By lemma 1] there 
exists a 8 such that f ¥ e” in|z| <1. We write 


K3(z) = 4ez/(1 + 2)’. 


This function (Koebe’s function) maps the circle | z | < 1 on the plane slit along 
a straight line from e” to ©, and the inverse function K;' (w) has branch points 
of order 2 at w = e” and at ~. Since f ¥ ec’, «©, however, either branch of 
Kj is regular and one-valued on W, the map of |z| < 1 by f. To fix ideas we 
suppose that Kj'(w) is the continuation over W of the branch which vanishes 
at w = 0," and write 


gz) = Ke '\f@} =bz+---, 


where |b; | = | a, |/4. By Lemma 3 we have 


62) g's [ig rat—piie + [ig Ral—pt cay, 





® More precisely, G* is the subdomain of G any point of which can be reached by starting 
at g(0) and travelling outward along some radius vector. G* does not depend on the par- 
ticular way in which the sheets of G are supposed stuck together. 

* p* (R), for example, is therefore the average valency of G* on the circumference | 9 |= 
R. Both G* and Z* are schlicht, and so p* and p¥ are bounded above by unity. 

” For a proof of (6.1) in the above form see Spencer (5). 

4 We might equally well take the other branch (which is infinite at the origin). 
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and it is sufficient to show that 


(6.3) l lg Rd[—p:(R)] <0 


(4) [ eRa-pi@) s 


(6.4) is immediate. For Z* is a schlicht domain which contains the point 
z = 0, and lies inside |z| <1. Therefore by a known inequality” we have 


exp if Ig reat —picei} < [ R’ d{[—p:(R)] = : r p2(R) d(rR’) < 1 


and this is equivalent to (6.4). 
As for (6.3), we prove the following stronger result that 








(6.5) [ \¢ Ral—p(R)] s 0, 
0 
where p;(R) is the valency function of G. 
Let €(Ro) be the set of points in |z| < 1 at which |g| = | Ks {f}| > Ro, 
and write 
I f'(re’*) |? 
A(8, Ro) = If ig Ig g edeilp = if ee) = 8] 1FGe*) pr arde. 


€(Ro) 


We suppose that Ro is small enough to Liiny any condition imposed by the 
argument. Then p,(R) = 1 for R S Ry by Lemma 1, and since p,(©) = 0 we 
have, integrating by parts, 


2x | lg Ral—p.(R)] = 2m [Ig Ral—p.(R)] 
(6.6) 0 Ro 
= | plR) dr ig R) — 2n Ig 2 = AGB, Re) - Peg 2. 
Ro 0 0 


Next, 


Uf ae ad ae AE pt per ere 








C > C 4Ro+CR* . 

<i [ [ if’ Prdrd, = al. “ogg POR) ACR) 
: art ’ 

C 4Rot+ CR} 


d(rR*) = O(Ro), 


R? 4Ro—CR} 


by Lemma 1 and the hypothesis that d = 1; and hence 


(62) A(8, Ra) = 3 pit Fp Tarde + OCR) 


if|>4Ro 





Se et canenecemeiiagiansslicieinets 
"* See Spencer (5). 
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But 
Iv’ P f. a RdRd 
[ [ T= rp = diay Le a Re® |B? 
6.8) a [ f n(Re'*) dR d® 
“tte c= R-~V/1 — 2R cos ( — 6) + R? 
oo 1(@+8) 

shi! - | Se ee 
Wt 4no ++ R~/1 — 2Rcos@ + R? 
1) Substituting from (6.8) into (6.7) we obtain the formula (valid for all sufficiently 

small Ro): 


n(Re\**®) 
| fade ee eee 
Integrating by parts we have 


dR dé + O(Ro). 








(6.9)  A(B, Ro) = 














| dR = * ap | 
ot sro R-+/1 — 2R cos ® + R? 1/1 — 2R cos® + R? 44x & 

_ aa Ge oe dR 
| Loa ~ / dR \4/1 — 2R cos & + FR? 


ieee 
4 . any \Y4no dR \+/1 — 2R cos ® + R? 
"ii! ‘ R — cos ® 

Jo ( sr B an) (1 — 2R cos + R?)3? - 


ot dott R — cos® 
n dR, 
*. I ( ar, dn) (1 — 2R cos @ + R?)” 


since (by hypothesis) d = 1 and so, by Lemma 1, n = 1 for0 S$ R <1. We 
write 


, aes R — cos® 
a = | 
= ae Lah " p28) (1 — BR cos ® + RH 


Then I(R , &) depends only on Ry and 4, but not on f. Next, writing 














—_ 











R i(®+8) 
(6.12) nRjet+e=[ “ap, 
4Ro R = 
we have 
= 1 R R R R dR, 
nia ~ 6) " li L. nab |. ” S (J, az) Ri 
1 R 


di _ jg B 
< 7 (R— 4B) +f (Ry — 4Ro) Se = le gp, 
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(6.1 
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py the hypothesis of strong mean 1-valency and the fact that n = 1if R < 1 
(since d = 1). Hence 
* dR R 

< ~ eA 
(6.13) N(R, ® + 8) < I Ene 
uniformly in ®. Since 

R — cos ® 
(R? — 2R cos ® + 1)*/ 

if R > 1, we have, by (6.10), (6.11), (6.12), and (6.13), that 


ro) i(®+8) r-) 
(6.14) [ enn att dR < . 

sno R-+/1 — 2R cos®@ + R? 4ro R-+/1 — 2R cos ® + R? 
Integrating both sides of (6.14) with respect to from —z to z, we obtain the 
inequality 





>0 


























rr pe n(Re*?*”) 
A(B, Ro) = : dRdé + O(R 
(8, Re) se aca 
Tr oo 1 
< dRdé + O 
_ pi [, Jon R.V/1 — 2R cos ® + R? ne 


1 


z 





r 1 2 
= [| |E| rarde + olf) = 2n lg = + OC), 
x /Ro Ro 
since when n is replaced by unity, we may suppose that f = Kg, and sog = z. 
Finally, substituting from (6.15) into (6.6), we have that 





[15 Ral—pie)) 5 OCR). 


Letting Ry) + 0, we obtain (6.5), and this completes the proof of Theorem 2 apart 
from the statement concerning equality. For this we note that equality implies 
by Lemma 3 that g = z, therefore that f = Kg. 


7. Suppose that f(0) = 0. By Lemma 1 mean 1-valency then implies: (i) 
that n(0) = 1; and (ii) that n(de®) = 0 for at least one 8. We now take for 
hypotheses only (i) and (ii), and obtain the following theorem: 

THEOREM 3. Suppose that 


f(z) = azet+ dee’ + ow 
is regular for |z| < 1, that n(O) = 1, and that 








d= Inf fe) . 
jzj<1! 2 
Then if there is a 8 for which n(de®) = 0, we have 
(7.1) a>+jal. 


13 


SS aes 








as 
| 
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The constant 13 is not the best possible. As a matter of fact, a better (and 
in a sense best possible) constant is obtainable by a “subordination” argument 
(Lindelof’s principle). But the theorem is only mildly interesting on its own 
account, and is included here mainly in order to deduce Theorem 4 beloy. 
The hypothesis that n(de") = 0 for some 8 is necessary. This is seen by con- 
sidering the function 


f(z) = 2 €s 


for which n(de®) = n(0) = 1. . 

Since by hypothesis n(0) = 1, n(de”) = 0, we haved > 0. We suppose that 
d = 1, and prove that | a,| < 13. Then if 6 is a number for which n(e”) = 0, 
the function” g = Kz {f} is regular for |z| <1. By Lemma 3 we have, 
therefore, 


(7.2) ie 1! < [ier al—pteyl+ [le R dl—pi ce) 


< lg Rdl[—p*{R)] = J*, say, 


where pr (R) is the valency function of G*. We find an upper bound for J*. 
Now every point of the circle | w | < 1 is covered exactly once. For n(0) = 1, 
and if some point w, | wo | < 1, were covered n-times, where n # 1, then the 
projection of a boundary point would fall on the straight line connecting 0 with 
Wy ; and this is impossible since d = 1. 
We define 


K,{Glargz = 9,0S|z|< o]}, oe #0," 
K,g{Glarg z = 9, 0 S |z| < 1]}, g=0,7 


where @[---] is the set of points defined by the conditions stated in the bracket. 
Positions lying outside |w| < 1 of the lines t which cut |w| = 1 at intervals 
of 5° from 0° to 180° are shown in Fig. 4. We denote by Ws the subdomain 
of W any point P of which can be reached by starting at w = 0 and travelling 
outward along some line ts(y), ¢ = o(P), always in the sense of increasing | 2 |. 


Then G* = Kj'{Ws}. Also if we let nj(w) be the number of times Wg covers 
the point w, then 


(7.3) ts(y) = | 








(7.4) ng(w) <2 
for all w. 
Next, if Ro is sufficiently small, we have™ 
© pe * (Dy i( +8) 
7.5) ae+= ff np ee) aR ab — ax ig + + 018). 
429 7 R-v/1 — 2R cos + R? Ro 


That is to say, 2rJ* is the difference between the logarithmic area of G* and 








18 The branch of K;' which vanishes at w = 0 is again chosen. 
14 By an argument analogous to the one given in §6. 
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that of the unit circle |g| <1. If we combine (7.4) with the fact that ng (w) = 
n(w) = 1 for |w| < 1, we see that 
(7.6) Jest, 
where Jo is obtained by taking 
. 1,if|w| <1 
ng (w) = ‘ 
2,if|w|>1 


in (7.5) 
Finally, we may express J 0 in the form 


1 29 
i a = | udv(t), 
2 Jo 


where 
u = lg {1 — 2v/sin it (cos #t + sin ft) + 2 sin 3¢}. 
2s [Vv sin 4¢ (cos }t — sin } \ 


v=t—2tan ————— 
1 — V/sin 3 (cos #t + sin }) 





A computation gives 
(7.7) 1.1541 < Jo < 1.1784, 
and substituting the upper bound from (7.7) into (7.2), we get (7.1). 
8. By combining Theorem 3 above with Theorem 3 of my paper (4), we 
obtain the following theorem: 


THEOREM 4. Suppose that f(z) satisfies the hypotheses of Theorem 3, and that, 
in addition, | f(z)| < Min|z| <1. Then, if 0 < p <1, 


(8.1) |argf| = Blg {1/(1 — p)} + O(1), 
where B = : lg (13M /| a; |). 


The constant in the O(1) depends on M, | a, |, and the particular determina- 
tion of arg f chosen. 


We write 
fi) = ett*, 
By Theorem 3 and the hypothesis that | f | < M, we have 
(8.2) Ig (| a1 |/13) S u S lg M, 


if|v| >. We cut W (the map of |z| < 1 by f) from w = 0 along a radius 
to a boundary point de”, n(de) = 0. We denote the resulting domain by W’, 
and write Z’ = f '{W’}. The function 


k= ia cae er 











18 We shall refer to this paper as V2. 
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is regular in Z’, and we apply the method of V2 to the function h. By (8.2) we 


have that 
| a: | 


ome Sak E < ; 
lg i3 = largh| S lg M 


Hence, in the notation of Ve, 


(8.3) OA(R) = Rig ncaa 

| a1 | 
uniformly in », if R > Ry = e”*. Substituting from (8.3) into Theorem 3 of 
V,, we obtain (8.1). 

Generalizations of Theorem 4 are possible. Condition (8.2) is much too 
strong; all that we require is that it is satisfied on the average. For example, 
in Theorem 4 it would be sufficient to suppose that | f |, gua function of arg f, 
is bounded when averaged with respect to arg f (as opposed to ordinary mean 
valency, which implies that | arg f| is bounded when averaged with respect 


to | f|). 


9, Suppose that the projection of a Riemann domain W lies inside a schlicht 
domain D in the space of the variable w, and that 7’ is regular and schlicht in D. 
We write 


w=Re* = Tw), nw’) =n(w), p(R) = = f n'(R'e'®’) de’. 
2r T 


The lines R’ = constant, 6’ = constant, define in D a new codrdinate system. 
We shall say that W is mean p-valent (7’) if for all R’ > 0, 


R’ 
| p'(R’) d(wR”) < prR”. 
0 
Suppose now that 


f(z) = az + ae’ +... 


is regular in |z| < 1, that f ¥ e®in|z| <1 (ie., n(e®) = 0), and define Wz 
as in §7. We cut through all sheets of Ws along a radial line from e” to ~, 
and denote the resulting set of points by Ws . If there exists a B, f ¥ e” in 
\z| < 1, for which Y; is mean 1-valent (Kj'), we say briefly that W is metri- 
cally subordinate to K. The following result is true: 

If f is metrically subordinate to K, then 


(9.1) |a,| S$ 4, 


with equality only if f = Ke. 
In fact, let 


G* = Ky'{Wa};  @* = Kp {Qs}. 
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Then if p;(R), pi (R) are the valency functions of G*, G* respectively, we have 


(9.2) o*(R) = pi(R) + pt (Z): fOSR<1, 
0, ifR > 1. 
Furthermore, by hypothesis, 
(9.3) [vt@ aR < | fR<1 
: 1, ifR>1. 


In particular, therefore, by Lemma 1, pr (R) = 1if0 S R < Ro, and go (in- 
tegrating by parts) 





[ era ote) = writ fot ace 
ok 2 f * 2 \e 
gs) =e Ri+ 2 lf. pi'(R) a(R’) 
= Ig Rj + 2 [ 3 [ piace} +2 [ if. i pi (R) are) a 
=gkR+ht+tis, say, 
where by (9.3) 
(9.5) h=2f if pi (R) d(R’) - ni —IgRi+R-1 
(9.6) nse2f apy @ai— x. 


Using (9.2) and substituting from (9.5) and (9.6) into (9.4), we obtain that 


y= [ ig Ral—- pie) = [ wea pien+ [eR a— pte) 
0 0 1 


1 1 o 
= [ erat piel + [eral -pi(t)]=4[ wea alot so 
0 0 0 
and this is equivalent, by Lemma 3, to (9.1). 
10. The question now arises as to whether the result of Theorem 2 is true for 


mean 1-valent functions. I am not able to answer this question, the difficulty 
being as follows. 
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Let F; be the family of mean 1-valent functions, and let 


J, = Max | ” Ig Rdl—p.(R)], 


fer, 


Jt = Max | Ig Ra{—pi(R)), 
fer, “0 

where p:(R), pi (R) are the valency functions of G and G* respectively (so that 

jt = Max J*). Then 


(10.1) O< Ji < J. 


In fact, let W be the domain represented in Fig. 5, when n(w) = 1 for almost 
all w exterior to D, and D., and n(w) = 2 for we D,, n(w) = O for we De. 
The solid curved lines are (we suppose) lines &(¢). The boundary of W consists 
of the boundaries of D, and D2, and slits. Let D,(R), D.(R) be the widths of 
D, and D, on|w| = R. Wesuppose: (i) that D,(R) = D.(R) for all R; (ii) that 








Fia. 5 


all of W is swept out by lines &(y) as ¢ varies from 0 to 2x—that is to say, 
W=Ws. Both assumptions may plainly be fulfilled if D, , D. have suitable 
shapes, and (i) implies that p(R) = 1 for all R > 0, so that W is mean 1-valent. 
Now write w = Re'*, dW, = RdRd®, and let dW, be the transform of dW. 
by Ks'(w). Then if ® is the distance from the element dW,, to the point e”, 
we have by (6.8) with 8 = 0 that 


dW, = dW.,/A, say, 


WhereA = R°R. By sliding points w = Re from one of the two sheets covering 
D, along the circumference | w | = R into Dy, we can just fill the latter domain, 
and still have D, covered once. For the resulting picture J* = 0. Since 
A(Re™) is a decreasing function of @ as we slide w along the circumference 
| = R from D, into D, , we see that the original J* corresponding to Fig. 5 
must be positive; and (since J* < J;) this proves the first half of (10.1). 
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As for the second half, we may plainly suppose (by a trivial rotation of each 
function f) that 8 = 0. Then it is only necessary to point out that an argument 
similar to the above shows that J; is the area which results from the choice 









































‘1, OSR<1 
‘ 4 *@>1.. =. e823 
(10.2) nm (Re*®) =< ' 2  E. 
. cae on 
9 & > 1, 5 2 @< 5 
A\. 
yA 5 
A, 
4 
C 
3 
C, 
i. : 
| 
te] 
-2 =| oO 
Fia. 6 
In Fig. 6, C; is the transform of | w| = 1, C2 the transform of any w = 31/2 


by lg Ko'(w), and J; = : (A; — Az). For any function n which satisfies mean 


1-valency and differs ies n; in a set of positive plane measure, the corresponding 
J is less than J,. In particular, the function n* corresponding to any Wo 





ee re 
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plainly differs from m in a set of positive plane measure ;° and this proves the 


second half of (10.1). 
Thus the method of this paper fails to yield the result 


for the family F'; of mean 1-valent functions. 

It is perhaps worthwhile to point out that the area integral J (also J*) is a 
function of the parameter 8, and that J() exist for all 8 whether n(e”) = 0 
or not. This can be seen from the formula (6.9). Furthermore, 

1 Tr 

— <= 
(10.4) = } J (8) dg < 0 
if feF,. In fact, the integration has the effect of replacing n by p in the 
formulas of §6, and the argument given there (with strong mean 1-valency re- 
placed by mean 1-valency) yields (10.4). From (10.4) we see that we can 
choose a8, 8 say, for which 


(10.5) J (Bo) = 0. 


But we cannot deduce (10.3) from (10.5). The difficulty is that for this choice 
of 8 the function g = Kg {f} is not necessarily regular in Z*, and we cannot, 
therefore, apply Lemma 3. 

On the positive side it is true, of course, that 


(10.6) d = |a|/A, 


where A is an absolute constant. By Lemma | we see that a mean 1-valent f 
satisfies the hypotheses of Theorem 3, and therefore that A < 13. Buta rather 
crude computation of J; shows that (10.6) is true with A < 7. Finally the in- 
equality (10.1) indicates that the determination of Jj would still further reduce 
A. Although I have not attempted to determine J 4 , I should like to conclude 
the paper by pointing out certain qualitative considerations which indicate 
(though of course they do not prove) that J; (though positive) is small, and that 
(10.6) is therefore probably true with A = 4. 

We suppose again (without loss of generality) that @ = 0. The two lines 
4(¢), b(y’) which cut the circumference | w| = 1 at & — 1° and at & respec- 
tively, = 1°, together with a portion of | w| = 1 and the real axis, bound a 
certain region of the w-plane which we shall denote by A(®). We denote by 
a(®) the area of the transform of A(®) by lg Ko'(w). The function a(), labelled 
“area per degree’, is plotted against ® (in degrees) in Fig. 7. We note the fol- 
lowing two facts: (i) a(@) is an increasing function of ; and (ii) a(#) varies 
SS 


a n* is the “‘star’’ function derived from n; , and J*(n*) the corresponding ‘“J*’’, it 
8 not difficult to show that J*(n*) <0. If Af is the portion of A; which can be seen 


1 
from — ©, J*(n*) = "= {2AT — (A1 + As)}. 
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remarkably little, the variation being only a quarter (approximately) of its 
minimum. 

Now the domain W¢ is limited to a considerable extent by the shape of the 
lines t). Either We is schlicht, in which case J* S 0; or We covers some region 
D of the w-plane twice, in which case we suppose that D lies inside the region 
A(&) and touches the two boundary lines &(¢), f(y’) of A@). Then every point 
of A(®) (and its reflection in the real axis) is covered at least once. If now 4 is 
small, then the resulting contribution to J* is below average by (i); if © is large, 
the contribution is relatively large, but mean 1-valency prevents too many 





022 





en ee. 
021 Feet 


pel Area_per Degree 











< 





r Degree — 


. 
q 





+7 


O17 



































° 25 50 75 100 125 150 . 175 
Degrees 


Fic. 7 


D’s from having this location, as can be seen from Fig. 4. In other words, mean 
1-valency tends to prevent the accumulation of area in A’s with large a’s, and 
this is an indication that J 7 is small. 

As an example, let us consider the class C of functions f of F; for which Wo has 
the property that either no point of a region A(®) is covered, or every point of it 
is covered twice. Then by mean 1-valency at most 180 regions A(®) are occupied 
(since each must be filled twice). If we fill twice the regions A(@), @ = 90° + 2, 
v = 1°, 2°, .-- , 180°, a precise computation shows that (10.6) is then true for 
this function with A = 4.35; and by property (i) above we see at once that (10.6) 
with A = 4.35 is true throughout the class C. For this result we have used mean 
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j-valency only on the circumference | w| = 1. If we use it everywhere, we find 
that (10.6) is true in C with A = 4, 


11. In conclusion I should like to point out that results cbtained for mean 
l-valent functions may be immediately transferred to mean p-valent functions 
of the form 


(11.1) f(z) = Dane", 
In fact, the function 
filz) = {f(@)}"" 


is regular in |z| < 1 since the mean p-valency of f implies that f = 0 only 
atz = 0. Let m(w) be the number of times f, takes the value w in |z| < 1, 
and let p:(R) be defined by (2.2) and (2.3) in terms of n,. Then if p(R) is the 
corresponding function for f, it is easy to see that 


(11.2) pi(R) = 7 PR”), 


and so 


R gl ig R f : 
= I (BR) dak") = © I p(R”) d(wR?) 


Le. 
ry sf p(R) d(nR*) < —.peR’ = rR? 
P 0 


sie 


by Lemma 1 of my paper (4). Hence f; is mean 1-valent, and so we have that 
ldp1| S 2p-|a,|, d = Inf | f(z)/2? | > | ap |/7’, ete. 
lzl< 
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NECESSARY CONDITIONS IN THE THEORY OF INTERPOLATION 
IN THE COMPLEX DOMAIN 


J. H. Curtiss 
(Received May 18, 1940) 
1. Introduction 


Let C be a closed limited point set of the complex z-plane, and let A¢ denote 
the class of functions analytic and single-valued on C. Let 


ee 
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mg 
be an infinite set of points of the finite plane which either lie on C or have no 
limit point except on C. (The points of M need not be all distinct.) A function 
f(z) € Ac will be analytic and single-valued at each of the points a,”, k = 


1,---,n, if n is sufficiently large, say n > n(f). There exists a unique poly- 
nomial L,,(z; f) of degree at most n — 1, n > n(f), found by interpolation to the 
function f(z) in the points at”, k = 1,---,n. 


Certain conditions are easily proved to be sufficient for the convergence of 
the sequence {L,(z; f)} to f(z) on C. The necessity of such conditions seems 
first to have been studied by Kalmar’ in his thesis of 1926. He proved the 
following theorem: 

TuroreM A. Let C be a closed Jordan region and let the points M lie on the 
boundary B of C. Then a necessary condition that lim L,(z;f) = f(z) uniformly 


on C for each function f(z) «Ac is that the points M be uniformly-densely dis- 
tributed on B. 

The sufficiency of the condition was proved earlier by Fejér.’ A uniformly- 
dense distribution of the points M on the Jordan curve B is defined as follows: 
Let the analytic function w = ¢g(z) map the complement of C (with respect to 
the extended plane) onto the region |w| > 1 so that the points at infinity 





1 See J. L. Walsh, Interpolation and Approximation by Rational Functions in the Complex 
Domain, New York, 1935, pp. 49-51. 

2L. Kalmér, Uber Interpolation. (In Hungarian.) Matematikai és Physikai Lapok, 
1926, pp. 120-149. 

°L. Fejér, Interpolation und konforme Abbildung, Géttinger Nachrichten, 1918, pp. 
319-331. 
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correspond to each other. The mapping is bi-uniform and continuous for 
\w| 21. The points M are said to be uniformly-densely distributed on B if 
the points g(as”), kK = 1,-++,n, m = 1, 2,---, are uniformly-densely dis- 
tributed on the unit circle in the sense of Wey] A that is, if for any arc of length 1 
on the unit circle, 


nS: ay Ne a 


no Nl k 24 
g(ax”) C I. 


Kalm4r’ also proved the following theorem in the case that the region K is 
simply connected: 

TuzorEM B. Let the complement K of C be connected and regular in the sense 
that K possesses a Green’s function G(x, y) with pole at infinity. Let g(z) = 
°° where H = H(x, y) is a harmonic conjugate of G, and let A be the transfinite 
diameter (or capacity) of C. Then a necessary condition that lim L,(z;f) = f(z) 


uniformly on C, for each function f(z) € Ac, is that 


n 


lim | wn(z) |"" = Al e(e)|, wn(z) = I (2 — aj”), 
uniformly for z on any closed limited point set interior to K. 

The sufficiency of the condition was also proved by Kalmar in the case that K 
is simply connected. The formulation of Theorem B given here, and the proof 
of the theorem and its converse in the more general case, are due to Walsh,° 
who has also generalized Theorem A and its converse to the case of any finite 
number of mutually exterior closed Jordan regions.’ 

In the proof of these theorems, Kalmar and Walsh make use of the uniform 
convergence on C' of the sequence {Z,,(z;f)} only for functions f(z) belonging to a 
certain subclass AG of Ac. This subclass A’, is the class of all functions of the 
type f(z) = 1/(t — z), where the parameter ¢ assumes the affix of every point 
in K. It is our main purpose in this paper to show that the conclusions of 
Theorem A and Theorem B (with certain restrictions on the set C in the latter 
case) can be derived from an initial assumption which is much weaker than the 
condition of uniform convergence of the sequence {L,(z; f)} for each f(z) ¢ Pe 
In fact, our hypothesis will be merely that at a single interior point 2 of C, 
the following inequality holds: 


aponte / 
(1.1) lim |Ln(eo;f)\"" $1, fe)eAc. 
nc 

ee eee 

‘ . Weyl, Uber die Gleichverteilung von Zahlen mod Eins, Math. Ann., vol. 77 (1916), 
pp. 313-352. 

*Kalmér, loc. cit. 

"Walsh, op. cit., pp. 154-155, 159-162. 
: ‘Walsh, op. cit., pp. 168-170. See also G. Szegé, Bemerkungen zu einer Arbeit von 
errn M. Fekete, Math. Zeit., vol. 21 (1924), pp. 203-208. 
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In the course of our discussion, we shall derive several auxiliary results which 
appear to be of some interest in themselves, and which will accordingly be de- 
veloped in some detail They include several new non-trivial sufficient condi- 
tions for the convergence of the sequence {L,(z; f)} (Theorems 2.2, 4.1, 5.1, 5.2), 
an inequality which promises to be useful in the study of the fundamental poly- 
nomials w,(z) = [2 (2 — at”) (§3), and a new sufficient condition that points 
be uniformly-densely distributed on a Jordan curve (Theorem 5.1). In con- 
nection with the inequality, we shall digress briefly to discuss a recent paper by 
Lammel® (§4). 


2. Preliminary Theorems 


We shall show in this section that condition (1.1), without the restriction that 
z be an interior point of C, implies the convergence of the sequence {L,(z ; f)} 
for any f(z) «Ac. 

THEOREM 2.1. Let C be an arbitrary closed limited point set whose complement 
K is connected. Let the limit points of M all lie on C. Let 2 be any point of C 
and let K’ be an arbitrary closed limited point set interior to K. If 











(2.1) lim |Lalzo;1/(t—a))["S1, teK, 
then 
1/n 
(2.2) wn(20) Se<i, “tet 
no wn(t) 


where p is independent of t. 
The proof is indirect. For a fixed t, if n is sufficiently large, we may write | 








(2.3) bse teats [1 ie ols) |, amt. 
t—2z wn(t) 
Suppose now that for some & e K, 
seen 1/n 
(2.4) fk (| iw ge: 
no ‘wn (to) 








If uy > 1, we obtain a contradiction at once by reference to (2.1) and (2.3). 
But in any case we may proceed as follows: 

The functions (log | w,(é) |)/n form a normal family of harmonic functions in 
any closed limited sub-region of K, provided that n is sufficiently large, because 
in that case |t — a” |, k = 1, --- , n, is not only uniformly bounded, but also 
uniformly bounded from zero. Since (log | w,(é) |)/n — © as t > ©, no limit 
function of the family can be identically constant in K. Let the subscript ’ 
run through an infinite subsequence of the positive integers such that if n in (2.4) 





* E. Lammel, Uber Approximation regulirer Funktionen eines komplexen Argumentes 
durch rationale Funktionen, Math. Zeit., vol. 46 (1940), pp. 104-116. 
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is replaced by n’, the symbol lim can be replaced by the symbol lim. There 
exists a limit function W(é) defined by the equation 
ies n’? 
log W(t) = lim er te K,t ¥ ow, 
where n’’ assumes an infinite subsequence of the values assumed by n’. The 
Principle of the Maximum for harmonic functions, applied to log W(t), now 


shows that there exists a point t, eK such that W(t) < W(t). We have 


wn’ (Z0) Wn’? (Zo) wnr?(to) ~sal =p W (to) 
wnrr(th) Wnr'(to)conr(th) ° W(h) 


Thus there exists a number yp; > 1 such that for n” > n(u:), we can write the 
inequality 


ljn’? 




















lim > §. 


n''—00 n'’'—co 














wn’ (Zo) nit 
= . 
wnrr(th) = M1 
Substituting into (2.3), we find that 
nt 1 din’? n'? n't 
| Lnvlzo; 1/(t — 20)] |" = row (ur — 1)", n” > n(w), 





which contradicts (2.1). Thus for each ¢ in K the first member of (2.2) is less 
than unity. The fact that it is bounded from unity for ¢ eK’ now follows from 
the observation that if n is sufficiently large, the functions whose superior limit 
appears in (2.2) form a bounded equicontinuous family for ¢ e K’, so the superior 
limit is a continuous function of t on K’. 

THEoREM 2.2. Let C be an arbitrary closed limited point set whose complement 
K is connected. Let the limit points of M all lie on C. A necessary and sufficient 
condition that 


(2.5)  Ln(eo sf) — flo) = Or"), r<l, meC, flzjeAc, 
where r depends on f(z), is that 
lim | Zalzo;1/(¢ — 2] (""S 1, teXK. 

The necessity is obvious from (2.3). In the sufficiency, we use the fact that 
given a function f(z) e Ac , there exists a point set S consisting of a finite number 
of regions, each bounded by a finite number of non-intersecting rectifiable 
curves, such that S contains C in its interior and such that f(z) is analytic and 
single-valued on the corresponding closed set S.’ Let the set K’ of Theorem 2.1 
be the boundary of S. The Cauchy-Hermite formula for L,(z; f) can be set up 


as follows: 
L,(zo;f) = § / SO. [1 = i dt, n>nf), 


2mi dei t —z wn(t) 
and the remainder of the proof follows at once from Theorem 2.1. 


m_— 





* Walsh, op. cit., pp. 12-13. 
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We add three remarks concerning these theorems. 

(a) It is not difficult to devise sequences M for which (2.1) obviously holds, 
If the points a”, k = 1, --- , n, are assumed to be distinct, we have the familiar 
formula 


wn(Zo) 
(zo — af” )wn(af”) 


Thus a sufficient condition that the sequence M should satisfy (2.1) is that 





Liles; f) = D Stes”) 





sonlte) (n) s Xn; k = x, eee n, lim xi s L. 
(Zo — & Jon (ax ) n—0 


Hee (2.6) 


Sequences M which satisfy (2.6) for every point 2 ¢ C are easily shown to exist, 
and have been studied by a number of authors; an example of such a sequence M 
: is given by the ‘Fekete points.’”” 
(b) If instead of being valid merely at a single point 2, (2.1) or (2.2) holds 
: at each point z of a closed subset C’ of C containing no limit points of M, then 
) the functions whose superior limit appears in (2.2) form (for 7 sufficiently large) 
' Wie a bounded equicontinuous family in the two variables z and t, z¢C’, te K’. 
| Accordingly, in this case the number p in (2.2) may be chosen so as to be inde- 
pendent of both z and t, z eC’, t e K’, and (2.5) holds uniformly for z ¢ C’. 
i id: (ec) The proof of Theorem 2.1 does not apply to the case in which K is not 
Het | connected, because then it may happen that the limit function W(¢) used in the 
i ate proof is a constant. In fact, both Theorems 2.1 and 2.2 are false in this case, 
righ i as may be seen by letting C be a circular annulus whose outer boundary is the 
unit circle, and letting the points a4”, k = 1, ---,n be the n-th roots of unity. 


: | 3. An inequality 





| Lemma 3.1. Let ai, d2,--+,@n be n real numbers such that qn S % SJ 1, 
k=1,---,n, where0<4»<1,0<q<1/n. Then 
q(a1, @2, +++, Qn) = I] (ax + 9)" — TI a” 
(= k=1 
(3.1) 1; n +92 1/q+)) 
< 





« log q — log (q + 1) — log (y + 1) 2 < i/(q+ 1) 
(ow) og 9 + log (@ + 1) + log @ 41)’ Bae ais * * 








t 


where 
< = 108 log an — log [log n + log (g + 1) + log (n + 1)] — log (a +1) 
log (q + 1) — log q — log (m + 1) 


It should be observed that the right member of the inequality is independent 
of %. 








1 See Walsh, op. cit., pp. 170-173. 
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To prove the inequality, we first remark that the maximum value of the 
function (ai, @2, +++ , @n) in the n-dimensional cube gn S a, S$ 1,k = 1,--- , n, 
is assumed at one of the “corners” of this cube. This may be readily estab- 
lished by using the symmetry of g and by examining the partial derivatives of 
the function g(a, %,+++,%e, Gky1, +++ ,@,) with respect to the a’s, where 
each of the z’s assumes either the value gy or 1. We leave the details to the 
reader. 

Suppose now that each a is an z, and that m of the 2’s are equal to qn and 
n — m of them are equal to 1. Then 


g(t, t2, +++» %n) = h(m) = (1 + 9) "!"(qn + 0)™" — (qn)™”. 


Let the range of variation of m be extended to include all the real numbers. 
Computing the derivative of h(m) by the methods of elementary calculus, we 
find at once that if log (1 + ») + log (qn + ») = 0, then h’(m) > 0 for all m, 
so in this case the maximum of A(m) in the interval 0 < m S n occurs when 
n=m. Since h(n) = 7, we obtain the first part of (3.1). On the other hand, 
if log (1 + ») + log (qn + 7) < 0, then there exists a unique root m = m of 
the equation h’(m) = 0 in the interval 0 < m < n, and m is readily found 
to be the maximum point of h(m). The rather artificial-looking expression in 
the second part of (3.1) is simply h(m), and « = m/n. 

It is apparent from the method of proof and from the order of magnitude of 
h(m) — h({m]) (where [m] denotes the largest integer in m) that the inequality 
cannot be essentially improved. The inequality is more convenient when stated 
in less explicit form. Since « is positive, the second expression in the third 
member of (3.1) is clearly dominated by [log 2 + log (1 + 1/q)]/| log 7 |. Thus 
we obtain the following corollary: 

Lemma 3.2. Let a1,+++,@n, , and q satisfy the conditions in Lemma 3.1. 
Then there exists a number Aq independent of n and n such that 


(3.2) II @ +n)” — I] ai” < = 
k=1 k=l | log m | 





It happens that the right member of (3.2) reflects accurately the order of 
magnitude of the right member of (3.1) for small values of 7, because 
lim,0 (qn)* = 1/e. 

We now apply these results to the fundamental polynomials w,(z). 

TuEorEM 3.1. Let z be a point of the complex z-plane such that z # ay”, 
k=1,.-.,n. Let D be a number such that |z — og” | S$ D,k =1,---, 2. 
If tis a point which satisfies the inequality |t — z| < 5, where 0 < 6 < D, and 
if q be chosen so that gs < |z — af” |, k = 1, +--+, n, then a number Ag,» exists 
which is independent of n, 2, t, and 6, such that 
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We add three remarks concerning these theorems. 

(a) It is not difficult to devise sequences M for which (2.1) obviously holds, 
If the points at”, k = 1, --- , n, are assumed to be distinct, we have the familiar 
formula 





Ln(eo;$) = 2 Jlas””) ——_#nb@0 


(0 — at” Jeon (os””) 


Thus a sufficient condition that the sequence M should satisfy (2.1) is that 





2 co 
“= » (n) ~ Xn, k= 1, Koy lim ai = 1. 
(2 — a” )wn(az”) rig 


(2.6) 


Sequences M which satisfy (2.6) for every point 2 ¢ C are easily shown to exist, 
and have been studied by a number of authors; an example of such a sequence M 
is given by the ‘Fekete points.’”” 

(b) If instead of being valid merely at a single point 2, (2.1) or (2.2) holds 
at each point z of a closed subset C’ of C containing no limit points of M, then 
the functions whose superior limit appears in (2.2) form (for n sufficiently large) 
a bounded equicontinuous family in the two variables z and t, z eC’, te K’. 
Accordingly, in this case the number p in (2.2) may be chosen so as to be inde- 
pendent of both z and t, z eC’, t e K’, and (2.5) holds uniformly for z ¢ C’. 

(ec) The proof of Theorem 2.1 does not apply to the case in which K is not 
connected, because then it may happen that the limit function W(é) used in the 
proof is a constant. In fact, both Theorems 2.1 and 2.2 are false in this case, 
as may be seen by letting C be a circular annulus whose outer boundary is the 
unit circle, and letting the points a4”, k = 1, ---,n be the n-th roots of unity. 


3. An inequality 


Lema 3.1. Let a1, a2,--+,@n be n real numbers such that qn S a S 1, 
k =1,.---,n, where0 <7 <1,0<q<1/y. Then 
q(a1, Q2,+++,@n) = I] (a, +n)” — [J ai” 
a k=l 
(3.1) uF on +2 1/@+)) 





« log gq — log (q+ 1) — log (n + 1) ' 
( joga + log @ +1) + log @+iy 71" q+), 


where 
< = 108 log an — log [log m + log (g + 1) + log (n + 1)] — log (a +1) 
log (g + 1) — log g — log (n + 1) 


It should be observed that the right member of the inequality is independent 
of %. 








See Walsh, op. cit., pp. 170-173. 
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To prove the inequality, we first remark that the maximum value of the 
function g(a: , @2 , +++ , @n) in the n-dimensional cube gn < a, S$ 1,k = 1, ---,n, 
is assumed at one of the “corners” of this cube. This may be readily estab- 
lished by using the symmetry of g and by examining the partial derivatives of 
the function g(t1, %,+++,%e, Qky1, +++ ,@n) With respect to the a’s, where 
each of the z’s assumes either the value gy or 1. We leave the details to the 
reader. 

Suppose now that each a is an z, and that m of the z’s are equal to qn and 
n — m of them are equal to 1. Then 


g(a, t2, +++, 2n) = h(m) = (1 + 9) ”"(qn + 2)” — (qn)™". 


Let the range of variation of m be extended to include all the real numbers. 
Computing the derivative of h(m) by the methods of elementary calculus, we 
find at once that if log (1 + ») + log (qn + ») = 0, then h’(m) > 0 for all m, 
so in this case the maximum of A(m) in the interval 0 < m S n occurs when 
n=m. Since h(n) = n, we obtain the first part of (3.1). On the other hand, 
if log (1 + ») + log (gn + ») < 0, then there exists a unique root m = m of 
the equation h’(m) = 0 in the interval 0 < m < n, and m is readily found 
to be the maximum point of h(m). The rather artificial-looking expression in 
the second part of (3.1) is simply h(m), and x = m/n. 

It is apparent from the method of proof and from the order of magnitude of 
h(m) — h({m]) (where [m] denotes the largest integer in m) that the inequality 
cannot be essentially improved. The inequality is more convenient when stated 
in less explicit form. Since « is positive, the second expression in the third 
member of (3.1) is clearly dominated by [log 2 + log (1 + 1/q)]/| log |. Thus 
we obtain the following corollary: 

Lemma 3.2. Let a,,-++,@n, , and q satisfy the conditions in Lemma 3.1. 
Then there exists a number Aq independent of n and n such that 


(3.2) Il (a: + 0)" — TL at" = 4e_. 
k=1 k=l | log 7| 





It happens that the right member of (3.2) reflects accurately the order of 
magnitude of the right member of (3.1) for small values of 1, because 
lim,+0 (qn)* = 1/e. 

We now apply these results to the fundamental polynomials w,(z). 

THEorEM 3.1. Let z be a point of the complex z-plane such that z # a”, 
k=1,...,n. Let D be a number such that |z — af” | S$ D,k =1,---,n. 
If tis a point which satisfies the inequality |t — z| < 5, where 0 < 6 < D, and 
if q be chosen so that gs S |z — af” |, k = 1,---, n, then a number Ayn exists 
which is independent of n, z, t, and 8, such that 


Aqp 


1/n 1/n 
| walt) |" S | waz) |" + [log 8|° 
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For the proof, let a, = |z — as” |/D, k = 1,---,n, and 7 = 8/D. The 
numbers a, , 7, and q satisfy the conditions of Lemma 3.1. Since | ¢ — a| < 
|z — a| + 6, we may write 


n 


| wa(t) ee ~ I] |t = af” - < I] (\z ane ax” | + 5)" 


=1 
= |w,(z) |"" + D | II (a, +)” — I ai". 


The conclusion of the theorem now follows at once from Lemma 3.2. 

THEOREM 3.2. Let C be an arbitrary closed point set of the extended plane (not 
the entire plane) with interior points, and let the derivative of the set M be a limited 
set contained in the boundary of C. Let {z;} be an infinite sequence of finite in- 
terior points of C approaching a finite boundary point B of C. Then given any 
e > 0, there exists a corresponding sequence {t;} of points of the complement of C 
approaching B, such that for all 7 > j(6), 


(3.3) | on(ti) |"" < fon(z) "+e n> n(j). 


For let D = bound | z; — of” |,j = 1,2,---,4 =1,2,---,n,n=1,2,---. 
Let 3e;/4 denote the distance from z; to the boundary of C. By choosing j 
sufficiently large, say 7 > j(€), we can make e; so small that (in the notation of 
Theorem 3.1) A1j2,p/| log e;| < ¢«. There exists for each j a point ¢; in the 
complement of C such that | t; — z;| < ¢;. Moreover, for n sufficiently large, 
say n > n(j), |z; — og” | = 4e;,k = 1,---,n. The conditions of Theorem 
3.1 with gq = 3, 6 = e¢;, are satisfied for each j > j(€), provided that n > n(j), 
and accordingly we obtain (3.3). 

In general, ¢; and z; cannot be interchanged in (3.3); that is, given a sequence 
{z;} in C approaching a boundary point of C, we cannot conclude that there 
exists a sequence {t;} in the complement of C such that for 7 > j(e), 


(3.4) I] on(z;) |" — Jon(t) "| Se n> n(j), 


where e > 0 is arbitrary. That this situation obtains can easily be proved by 
examples; it is attributable to the fact that an inequality such as (3.1), where 
the right member approaches zero with n, is not available if g = 0. However, 
if C is a closed Jordan region bounded by a suitably restricted curve (the require- 
ments fall considerably short of analyticity), then the equicontinuity property 
of the sequence {| w,(z) |/"} expressed by (3.4) can be established by means of 
Theorem 3.1. We shall not need this equicontinuity property in the investiga- 
tion of this sequence which follows, so we shall not discuss the matter further. 


4. Applications of the inequality 


Let C be a closed limited point set whose complement K is connected; further- 
more, let us assume that C has interior points and that all limit points of the 
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set M lie on the boundary B of C. Suppose now that (1.1) holds at an interior 
point z of C. There exists a maximal region Ro of C containing z ; the boundary 
of Ry is contained in B. If n is sufficiently large, the functions (log | w,(z) |)/n 
form a normal family of harmonic functions in any closed subregion of Ro. 
It is a simple consequence of Theorem 2.1 and 3.2 that each limit function of 
this family, or of the family | w,(z) |", is a constant, as we now shall show. 

Let W(z) be an arbitrary limit function of the family | w,(z) |"; W(z) is de- 
fined by an equation of the following type: 


log W(z) = lim eee, zeRo, 
where n’ runs through some infinite subsequence of the positive integers. If 
W(z) is not identically constant in Ry , then by the Principle of the Maximum 
for harmonic functions, there is an infinite sequence {z;} of points of Ry ap- 
proaching a point of B such that for some e > 0, W(z;) < W(a) — 2e,j = 
1,2,--.. But by Theorem 3.2, there exists a subscript J and a point t; in K 
such that 


| wn(ts) [/" S |on(er) "+6 n> n(J). 


Therefore 


Him |on(Qs) |M"" S Him | wy-(2s) |" + € = Wes) + < We) — 6; 


n'—0 n'—>o 
fim | wa(ts) |" < Tim | wnr(zo) |’. 
n'-0 n’—>o 


But the last inequality contradicts (2.2), so W(z) must be a constant. We are 
now in a position to state the following theorem: 

TaEorEM 4.1. Let C be a closed limited point set whose complement K is 
connected. Let all limit points of the set M lie on the boundary of C. Let x be 
some interior point of C, and let Ry be the maximal region of C containing 2. If 


lim | Lalzo;1/(t — 2)]|"" 51, te K, 


then 

(a) each limit function in Ro of the normal family (log | wn(z) |)/n is a constant; 

(b) for every function f(z) ¢€ Ac, Ln(z; f) — f(z) = O(r"), r < 1, uniformly on 
any closed point set interior to Ro . 

Part (b) of the theorem is a simple consequence of part (a) of the theorem, 
= of Theorems 2.1, 2.2, and Remark (b) of §1. We leave the proof to the 
reader, 


The theorem is false if limit points of M are allowed to lie inside Ro. For 
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example, if C is the set |z| < 1, andif av” = &""’",k =1,---, n, af” = 1 
k=n-+1,---, 2n, then 


’ 


‘ 1/2n l2-—4 lz) <1 
lim | wen(z) |" = 


n—r0 (jz| |2 -—4)}, |z| > 1; 
lim Len(z;f) = f(z), |2 —4| S$ 2,f@) €Ac; 


but the sequence {L»,(z; f)}, f(z) «Ac, may diverge if |z — $| > 3. 

The problem involved in the proof of part (a) of the theorem is typical of a 
class of problems arising in the study of interpolation by rational functions in 
the complex domain, in which certain properties of the sequence {| w,(z) |"’"} 
are to be derived from other properties of this sequence known for z both in- 
terior and exterior to a closed set C. In dealing with such problems, the in- 
equality (3.3) may provide a useful tool.” The writer intends to discuss this 
matter elsewhere in more detail. However, it seems not out of place to mention 
an application of the inequality to a line of investigation which is related to 
Theorem A, but which is concerned with interpolatory series of rational func- 
tions with poles at the points M. 

Recently Lammel” has derived necessary and sufficient conditions that a cer- 
tain expansion of the type just described should converge interior to a Jordan 
region R to any function f(z) «Az, where R is the eorresponding closed region. 
The points M are chosen on the boundary of R. In studying the necessity, his 
starting point essentially is the condition 

— |z—a| . |t-—a| 
me nine [e)[U* = ge [eau | 
which is to hold for all z eR and all ¢ in the complement K of R, where a is 
some point of R. From this inequality he infers, by a rather complicated argu- 
ment, that 


lt-—a| _ 


(4.2) lim =, 


noo | wn(t) [tn 
uniformly on any closed subset of K. But a line of reasoning similar to that 
used to establish Theorem 4.1a (the reader will have no trouble in supplying the 
details) will show that (4.2) is quite an immediate consequence of (4.1) and 
Theorem 3.2. Moreover in deriving (4.2) by this method, the restriction to a 
Jordan region becomes quite superfluous; indeed, R may be any region whose 
boundary is also the boundary of an infinite region.” 





11 Another application of the inequality similar to the one in this section appears in the 
proof o: Theorem 5.1b below. 

122 E. Lammel, loc. cit. 

13 Tt follows from this that Lammel’s Theorem 1 (loc. cit., p. 105) is true for such regions 
R. 
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5. The revised version of Theorems A and B 


We shall now prove Theorem A under the revised hypothesis (1.1), and also. 


Theorem B under this hypothesis, but with the necessary additional restriction 
on C that it be the closure of a region & whose boundary is also the boundary of 
an infinite region. Our method of approach is indicated in the following 
theorem: 

TurorEM 5.1. Let R be a limited region whose boundary B is the boundary 
of an infinite region K. Let all limit points of the set M lie on B. Necessary 
conditions that every limit function in R of the normal family (log | w,(z) |)/n 
should be a constant are that 


(a) lim | w,(2) |" = A, uniformly on any closed point set interior to R,. 


(5.1) where A is the transfinite diameter of R + B, 
(b) lim | wn(z) |" = A|¢y(z) |, uniformly on any closed limited point set 
(5.2) interior to K, where w = g(z) maps .K conformally onto the region 
|w|> 1 so that the points at infinity correspond to each other. 

If Risa Jordan region and if all the points M lie on B, then a further necessary 
condition is that the points M are uniformly-densely distributed on B. 

Put in another way, the theorem states that if the mean radii of the lem- 
niscates |w,(z) | = wn, nm = 1, 2,--- , passing through an arbitrary point z e R, 
approaches as » — © a limit which is constant for all z eR, then this limit 
must be the same as the limit of the mean radii of the minimal covering lem- 
niscates of R + B of respective degrees n.“ 

Let £,(z) = (log | wn(z) |)/n. Let {f,-(z)} be an arbitrary convergent subse- 
quence of the family ¢,(z). The convergence of this subsequence is uniform in 
some neighborhood N of a point z; e R, and the neighborhood is independent of 
the particular subsequence under consideration. Since the limit of the subse- 
quence is a constant, the two corresponding subsequences of partial derivatives 
with respect to x and y, 2 = x + iy, both converge uniformly to zero in N.” 
Therefore 


Do (e — af’) 
(5.3) lim + = 0, uniformly, zeN. 


7 
n'—>0o 





But the sequence {f,-(z)} was arbitrary, so we are able to conclude that every 
limit function of the normal family of analytic functions [D (i (2 — ag”) | /n 
is identically zero in N. Therefore (5.3) is still true if n’ is replaced by n. 


"I The latter limit is known to be A. See M. Fekete, Uber die Verteilung der Wurzeln 
bei gewissen algebraischen Gleichungen, etc., Math. Zeit., vol. 17 (1923), pp. 228-249. 


He F. Osgood, Lehrbuch der Funktionentheorie, fifth edition, Leipzig, 1928, vol. I, 
p. , 
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Differentiating and using the Weierstrass Double Series Theorem, we obtain the 
equations 


A, (a — af)” 
(5.4) lim R}| —= =0, »=1,2,3,-.., 


no: n 





where the numbers A, are any complex constants. 

Let z = ¥(w) be the inverse of the mapping function w = g(z). The function 
¥(w) is analytic and single-valued for |w| > 1 and may be represented by a 
Laurent series of the following type: 


z=y(w) =cw+ > Cw, c #0, |w| > 1, 
J 


where |c| = A.”° Thus 


dal ~ latwl4 


Therefore the function 1(z) = log | (2: — z)/e(z) |, when assigned the value 
log A at infinity, is harmonic for z e K; and if | g(z) | is defined to be unity on B, 
l(z) is continuous in K + B. Invert the plane in the point z, by means of the 
transformation z’ = 1/(z: — z), and let K’ be the image of K. The function 
L,(z’) = U(z, — 1/2’) is harmonic interior to the limited region K’ and continuous 
in the corresponding closed region K’. A theorem of Walsh” now states that 
given an arbitrary « > 0, there exists a harmonic polynomial P(z’) such that 
| Li(z’) — P(z')| < ¢/4, 2’ €K'. The polynomial P(z’) is an expression of the 
type Ao + >-7 R(A,z”), where Ao is a real constant and the A, are complex 
constants. In particular, we find that |1,(0) — P(0) | = |log A — Ao| < «/4. 
If we use this fact when we reverse the inversion, we obtain the inequality 








(5.5) lim 


zo 


U(z) — log A — ss R[A(a — 2)7] 








<3; zeK +B, 








or 
log |21 — 2] — log — FMA — 271] <§, 208. 
v=1 
Therefore 
ly noe ee ‘Soe ~ (n)\— € 
|2 > tog ja Ak | log - 2 | A, 3 (ei ol ) <9 
n= 1, 2, 





16 Fekete, loc. cit., pp. 237-240. 
17 J. L. Walsh, The approximation of harmonic functions by harmonic polynomials and 
by harmonic rational functions, Bull. Am. Math. Soc., vol. 35 (1929), pp. 499-544; p. 503. 
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Referring now to (5.4), we find that for all values of n sufficiently large, 
| (log | wn(21) |)/m — log A| < «. Thus all the constant limit functions 
of the family (log |wn(z)|/nm have the value log A, which means that 
lim, (log | wn(z) |)/n = log A,2 eR. Since the functions form a normal family 


in R, this limit is uniform on any closed point set interior to R. We have estab- 


lished part (a) of the theorem. 

A suitably chosen branch of the function [w,(z)]""/(A-¢(z)), when defined to 
have the value unity at infinity (in accordance with (5.5)), is analytic in any 
closed subregion of K if n is sufficiently large, and is uniformly bounded in 
modulus in K, the bound being independent of n. Thus the functions 
w,(2)}/"/(A¢(z)) form a normal family of analytic functions in any closed sub- 
region of K, if n is sufficiently large. Let 0(z) be a limit function of this family. 
Since |@(#) | = 1, it follows from the Principle of the Maximum that either 
|9(z) | = 1, or else there exists an infinite sequence {¢;} of points of K approaching 
a point of B such that lim;..,, | @(¢;) | < 1. But Theorem 3.2 and part (a) of 
the present theorem show that the latter case is impossible. For if such a 
sequence {t;} existed, then (since lim;_,,, A|(¢;) || @(¢;) | < A) there would exist 
a number e > 0 such that | w,-(t;) [/"’ < A — 2e, n’ = n'(j), j > j(€), where n’ 
runs through some infinite subsequence of the positive integers. However, by 
Theorem 3.2, there exists a subscript J > j(e) and a point z, e R such that 


| wn(2s) [Y™ S low(ts) PP" +e<A—2ete n'>n'(J). 


If we apply part (a) of the present theorem to the first member of this inequality, 
we obtain a contradiction at once. Thus every limit function of the normal 
family [w,(z)]’"/(Ag(z)) is a constant of modulus unity. This establishes part 
b of the theorem. 

The proof of the last part of the theorem is now a consequence of the work 
of Kalmér,”* who showed in his proof of Theorems A and B that if R is a Jordan 
region and if the points M lie on B, then equation (5.2) implies that the points 
M are uniformly-densely distributed on B. 

Combining Theorems 4.1 and 5.1, we may state our revised version of 
Theorems A and B very briefly as follows: 

TuroreM 5.2. Under the hypotheses of Theorem 5.1 concerning R and M, the 
conditions (a) and (b) of that theorem and the uniformly dense distribution of the 
points M are necessary conditions that at some point 2 ¢ R, 


(5.6) lim | Lalzo; 1/(t — a))[""S1, te. 


Of course these necessary conditions are also sufficient that (5.6) hold. In- 


Se MLR ES 9.2 Ts 


* Kalméar, loc. cit. See also Walsh, op. cit., pp. 168-170. 
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Am 
Vol 
deed, in the light of Theorem B, it is seen that (5.6) or the hypotheses of Theorem 
5.1 are sufficient conditions that 
lim L,(z;f) = f(z), zeR+B, S(2) € Anse, 
the convergence being not only uniform with respect to z, but ‘“‘maximal.’” 
The conclusions of Theorem 5.2 would be false if z were a boundary point of 
R, or if R + B were replaced by any more general closed set of C of which x is 
an interior point, even if the complement of C were simply connected. The 
reader will have no trouble verifying these statements by means of trivial 
examples similar to the one in §4. 
It is perhaps of interest to observe that an alternative method of proof for abl 
Theorem 5.1 is available which in no way depends on Kalmér’s work. The 
procedure consists in using a well-known necessary and sufficient condition of ‘ 
Weyl” that the points o(a{”) be uniformly-densely distributed on the unit ant 
circle, together with equations (5.4), to establish the last part of the theorem é 
first. Equations (5.1) and (5.2) for a Jordan region may then be obtained pty 
readily from the Gauss Mean Value Theorem. The validity of these equations 
can then be extended to the more general case by approximating the region R (1) 
by means of its Green’s level curves. 
CorNELL UNIVERSITY. 7 
19 Walsh, op. cit., p. 154. tion 


*® Weyl, loc. cit., pp. 313-316. 
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HILBERT DISTANCES AND POSITIVE DEFINITE FUNCTIONS 


By S. Bocuner 
(Received January 20, 1941) 


I. Hitpert Distances 


If S is any topological space then a continuous complex function of two vari- 
ables f(P, Q) on S X © shall belong to class $ (positive definite functions) if 


1) f(P, Q) = FQ, P) 

2) f(P, P) = fQ, Q) 
and 

3’) for any integer n, any points P;,---, P, and any complex numbers 
pfi,***, Pn the relation 


() 2. HPs, Piosvi & 0, 


147= 


holds. 

The wider class %» shall consist of all those functions for which, other assump- 
tions being unchanged, the decisive inequality (1) is assumed to hold only subject 
to the restriction 


nr 


(2) Da = 0. 


Just how much wider the class > actually is will be established under further 
assumptions. The real classes $8 and $ shall consist of all real-valued func- 
tions of these classes. 

We will call a function p(P, Q) a distance function if p(P, Q) = p(Q, P) 2 0, 
o(P, P) = Oand p(P, Q) + o(Q, R) = p(P, R). Wecall p(P, Q) a proper distance 
if P ¥ Qimplies p(P, Q) > 0. Introducing the real Hilbert space of sequences 
itn}, {Yn} with the customary distance (}-n (a, — yn) )'”, we now call p(P, Q) 
a Hilbert distance on © if it is possible to map © into in such a way that the 
value of p(P, Q) shall be equal to the value of the latter distance for the trans- 
forms of P and Q. The following decisive criterion has been established by K. 
Menger and I. J. Schoenberg: if S is separable, then p(P, Q) is a Hilbert distance 
if and only if —p(P, Q)? belongs to Bo. Also, Schoenberg discovered the fact. 
that a function f(P, Q) belongs to PB if and only if eY” ed belongs to $ for each 


——— 

See I. J. Schoenberg, Metric spaces and positive definite functions, Trans. Amer. Math. 
Soc. 44 (1938), 522-536. 

* Positive definite functions on non-compact but commutative groups have been analyzed 
recently in A. Powzner, Doklady U.8.S.R., 28 (1940) 294-295, and D. Raikow, Doklady 
USS.R., 27 (1940), 324-327 and 28 (1940), 296-300. 
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positive \, and he analyzed Hilbert distances on open euclidean and Hilbert 
spaces. 

In the present note we will investigate Hilbert distances for some types of 
general but compact separable spaces. We will obtain new relations between 
the classes $ and and structural criteria in terms of expansions in orthogonal 
systems.” 


Il. Compact Spaces witH BouNDED MEASURE 


Since f(P, Q) is continuous, if © is a finite interval, say, then condition 3’) for 
$ can be replaced by the following condition: 
3) for each integrable p(P) the relation 


(3) [ [ 1, @or(@ aP ag = 0 
holds; and for fp relation (3) holds subject to the restriction 
(4) / p(P) dP = 0. 


This replacement of sums by integrals is admissible for more general spaces. 

Lemma 1. Conditions 3) and 3’) are equivalent if a), S is both separable and 
compact, 8) there exists a Lebesque measure dP on © for which y) all Borel sets are 
measurable, 5) every open set has non-vanishing measure and e) the total space has 
measure 1. 

Proor: We first observe that 6 and S X © are each bi-compact, and that 
therefore every continuous function f(P, Q) is uniformly continuous and bounded. 
In the proof M will be independent of e. We will give the proof of the 
lemma for functions of $, for functions of $ it is even simpler. Let 
S(P, Q) satisfy 3’) and let p(P) be any function satisfying (4). Given « > 0 
we partition © into a finite number of Borel sets G; , --- , SG» such that f(P, Q) 
oscillates by less than ¢ on each ©; X G;. Choosing an arbitrary point P; 


in ©; and putting p; = [ p(P) dP we obviously have (2) and 
Si 


(5) X 1P:, Pedi [ [ $P, @o(P)o(@ aP dQ] < ei. 


t,j=1 

Letting « — 0 we obtain (3). Conversely, if f(P, Q) satisfies 3), and distinct 
points {P;} and numbers {p;} are given, and if (2) holds, then for « > 0, we pick 
a set of disjoint neighborhoods ©; of P; , such that f(P, Q) oscillates by less than 


eon each ©; X G;. Putting p(P) = ——? — for PeG,, and p(P) = 9 
measure ©; 


for PeS — (©, + --- + G,), we have relation (2) and (5), and, by a limit, 
relation (1). 








? Whenever the range of integration is not indicated it is the total set to which the 
variable refers. 
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In what follows we will always assume that © satisfies the assumptions of 


the lemma. 
TazoreM 1. If f(P, Q) belongs to $, if o(P) is continuous and c is a real 


constant, then 
(6) a(P, Q) = (P,Q) + oP) + o(Q) — 


belongs to Bo except for property 2). 
Conversely, if g(P, Q) belongs to Po, then there exists a continuous function 
y(P) and a real constant c such that 


(?) f(P, Q = GP, ® — oP) - oQ +e 
belongs to 8, except for property 2), as for instance 

8) oP) = | (P,Q) a0 

(9) c= / g(P) dP = / / g(P, Q) dP dQ. 


Proor: The first part follows from the fact that (4) implies 


ay —- [ [ ere aP ae = | o(P)o(P) a. [ 9 ae = 0; 


the second part follows from the fact that (7), (8), (9) and A = | p(P) dP 
implies 


[[12, @oPro@ ap ag = | f oP, DOP) — NOD — War ao. 


We call a function f(P, Q) measure invariant if / S(P, Q) dQ is independent 


of P. This leads to 
: TurorEM 2. A function g(P, Q) belongs to the measure invariant class Po , 
if and only if it can be written in the form 


(11) f(P, Q) — f(®, ®) 


where f(P, Q) belongs to the measure invariant class % and © is any point of S. 

We now assume the existence of a fixed transitive group © of continuous 
transformations of © into itself, and we assume that our measure is invariant 
under the operations of ©. We call a function f(P, Q) group invariant if for 
all elements s of €, f(sP, sQ) = f(P, Q), where sP is the transform of the point 
P under the transformation s. In particular f(sP, sP) = f(P, P). Now if P 
and Q are any two points, since our group is transitive there exists an element 
ssuch that Q = sP. Thus from now onward property 2) of $ will be automati- 
cally fulfilled for any function f(P, Q) which is group invariant. In order to 
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emphasize the independence of f(P, P) of the special point P we will designate 
it by f(®, &) where @ is any fixed point which is chosen appropriately, For 
instance if © itself is a.group it is customary to take for & the identity element 
of the group. 

If f(P, Q) is group invariant we also have 


o(P) = [ sP, a0 = | ser, 9Q)4Q = | f(eP, 20) a(00) 


= | KP, Q)a0 = o(sP), 


and this implies that y(P) is a constant. In other words, if f(P, Q) is group 
invariant it is also measure invariant. This leads to— 

TuEorEM 3. A function g(P, Q) belongs to the group invariant class Bo if and 
only if it can be written in the form (11) where f(P, Q) is group invariant and 
belongs to . 

A function p(P, Q) is a group invariant Hilbert distance if and only if it can be 
represented in the form 





where f(P, Q) is group invariant and belongs to the real class f. The function 
e(P, Q) is a proper distance if and only if f(P, Q) < f(®, ®) for P ¥ Q. 


III. Group InvARIANT FUNCTIONS 


In the present section we will supplement Theorem 3 by statements involving 
expansions in orthonormal systems. An orthonormal system {¢g,(P)} is of 
course defined by the property 


/ m(P)en(P) dP = bmn; 


the orthonormal system we will consider will automatically consist of continuous 
functions and it will be complete both in the space of continuous functions and 
in the space of integrable functions. 

The simplest space © is the torus 0 S x < 2x with the group of translations 
on it. Any continuous function f(z, y) on G X GS has a Fourier expansion 


(12) > Gane 
Group invariance means that f(x, y) = f(x + s, y + s) and this implies 
Qnn = Gut 


Thus, dm, = 0 if m # n, and (12) has the form 


(13) Y ane™e™, 
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. Now f(z, y) belongs to $ if and only if a,, is real and = 0, and }\ a, < ; 
or f(c, y) is real if and only if a_» = a@_. Thus, by Theorem 3, p(z, y) is a Hilbert 
at distance if and only if it is the square root of an expression 


(14) > On(2 — gine _ g-ilev)) 9 Gn(e™ — e™*)(e-™ — oi) 


m=1 


where 
(15) Om 20 and Yan < &. 


If we prefer to look upon p(z, y) as a function of the one variable z — y it is 
more appropriate to write (15) in the form 


p 
(] i) 2 
d (16) 2 3 a(1 — 08 m(x — y)) = 4 3 am (sin ™(e — 9). 
4 This result can be extended to our spaces in general in terms of expansions 
; into “generalized spherical harmonics” as given by E. Cartan and H. Weyl.‘ 
: If S is a space satisfying lemma 1 with a transitive group € of motions then 
there exists on S a complete orthonormal system of continuous functions of the 
following description : 
a (i) corresponding to each k,k = 0, +1, +2, +3, --- there exist a finite rectan- 
gular system of functions gé,m)(P) with 
m=1,---,11 = lk); 
( 17) ’ ’ 
1g w=1,---,h,h = hk), 
of and a quadratic system of functions Uz,ma(P) on € with m,a=1,---,l = 
l(k), such that } 
l 
(18) Pk,myu(SP) = > Uk,ma(8) Pk,au(P). 
Us (i) For each k, wi,ma(s) is an irreducible unitary representation of €, and for 
id different values of k the representations are inequivalent. 


(ii) fork = 0,1 = h = 1, and go(P) = 1, w,u(s) = 1; also 1(—k) = l(k), 
ns h(—k) = h(k), and 


¢-2alP) = GaaP), ” usanel@) = tamel?)- 


Now the system of functions 


Pp.mu(P ) * Go.nr(Q) 
is an orthonormal system on © X ©, and thus f(P, Q) has an expansion 
(19) py Sng nines Ooi FP )a.nr(Q). 
P»M pnw 


le 


‘H. Weyl, Harmonics on homogeneous manifolds, Annals of Math., 35 (1934), 486-494. 


ponent “* ae haha altars hte. 
7° ‘peters 
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Replacing P and Q by sP and sQ respectively and using (18) we obtain as a 
necessary and sufficient condition for group invariance the system of relations 
l(p) l(a) aS EDS H4 
Apg,ap,pv = Bpesnn.ar Hemet Menthe) 


m=1 n=l 


and this is equivalent with 
1(q) i(p) 


(20) > Apq,ap,nv Ug,pn(8) = p> pq, my.6r Up,ma(8). 


Since the functions {u(s)} are linearly independent the comparison of coefficients 
on both sides of (20) will show that @y¢,my,n, Vanishes if p ¥ q or if m ¥ n, and 
that for p = q and m = nits value is independent of m. This leads to writing 
(19) in the form 


C) h(k) 
(21) (Ear pnnl, ©) 
where 
1(k) 
(22) Pk,uv(P ’ Q) = x Pk, mu(P er, Pr,m»(Q). 


We can now express the properties of § in terms of the coefficients a;,,,. Prop- 
erty 1) of $ simply means that for each k, the matrix 


(23) | Qk u> | pirmd,--- ACR 


is hermitian. Property 2) is automatically fulfilled, and property 3) means that 
(23) is non-negative definite. In fact putting in (3) 


p(P) = y Lp Pk, mu(P) 


we obtain 

(24) kyr Ly = Oz 
Conversely if p(P) is a finite sail of the form 
(25) 2X Ap Po.mu(P) 


the left side of (3) is 





E (2, 029mm) 


he v= 


and this is = 0 if (24) holds. But finite sums of the form (25) are dense in the 
family of all function p(P) and thus (24) implies property 3). Finally by 
property (iii) of {g(P)}, f(P, Q) is real if and only if 


—ay 


(26) Qk = A-k,pr- 
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Thus a function f(P, Q) of the real class $ has an expansion of the form 


C h(k) 
(27) ao + >> (= Or w(Prw(P, Q) + Pk w(P, ®) 
k=1 y=] 


where each matrix (23) is a semi-definite hermitian matrix. Before proceeding 


we require the following 

Lemma 2. There exists an array of real numbers r,x,n = 1, 2,---, k = 0, 
+1,+2,--- , with the following properties: (i) 0 S rp,x S 1, (ii) lim,sorne = 1, 
(iii) for each n only a finite number of coefficients 7, is * 0, and what is decisive, 
(iv) if (21) is the expansion of a continuous function f(P, Q), then the sequence of 
functions 


J h 
08) JulP, Q) = SX roa( Z drsvernlP, @)) 


is uniformly convergent towards f(P, Q) asn > ~. 

The proof of the lemma can be carried out along familiar lines and will be 
omitted.” 

Now, if (23) is hermitian the number 


h 
a = > Ok, y» Pk,ur(P, ®) 
py=1 
is 2 0 and 


h 
» Se Ok,ur Pk,w(P, Q) 4 Ak 5 
Tt ed | 


therefore 
0 <2 Tnkrdk S fal, ®) S M 


where M is independent of n. Letting n — © we obtain Dx < M, and we 
hence conclude that the series (21) and (27) are absolutely and uniformly con- 
vergent. Therefore we have for 


f(®, ®) — f(P, Q) = 3f(P, P) + 3f(Q, Q) — HP, Q 


the series 
eo h(k) 
(29) > (2 Ory» Wieuo(P ’ @) 
k=1 y=1 
where 





l(k 
(0) Yase(P, Q) = 2 (onm(P) — enon Q)) Onme(P) ~ erme(Q)) 


*See S. Bochner and J. v. Neumann, Almost periodic functions in groups, Trans. Amer. 
Math. Soc., 37 (1935), 21-50, esp. Part III; H. Wey], loc. cit., p. 498-499. 
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Altogether, on the basis of Theorem 3 the following theorem can now be easily ti 

verified. G, 
TureorEM 4. A non-negative function p(P, Q) ts a group invariant Hilbert 

distance if and only if its square ts an absolutely and untformly convergent series eC 

of the form (29) in which such matrix (23) 1s non-negative hermitian and yy, has W 


the value (30). 
We observe that our series (29) is a generalization of series (14). 
TueoreM 5. The Hilbert distance of Theorem 4 is certainly a proper distance 


if each matrix (23) is strictly positive definite. W 
Proor: In fact, if (23) is strictly positive then p(P, Q) can only vanish if for 

each k, m, a ce 

f 

gr.ma(P) — grma(Q) = 0. y 

But the latter fact implies that every continuous function on © assumes equal ht 


values on the points P and Q, and this can only happen if P and Q are identical. 


IV. Some Spreciric CAsEs 


Formulas (27) and (29) are greatly simplified if for all values of k we have W 
h(k) = 1. In this case we can write 





(P,Q) = a + ¥ asler(P, ® + oP, @) 

















where ° 
catenins an (3 
(31) gx(P, Q) + ox(P, Q) = p> (x,m(P)e%m(Q) + geym(P)er,m(Q)) ; 
also the expression : 
(32) p> ax(2yr(, &) — o(P, Q) — ox(P, Q)) fu 
fu 
for p(P, Q)” shows a strong resemblance to the left side of (16). The resemblance al 
is most pronounced if © is the (m — 1)-dimensional unit sphere in m-dimensional 
Euclidean space. In this case (31) is but for a numerical factor depending on 
k (and m) the expression 
T 
Tt” (cos 9) 
() m 





where 7," is an ultraspherical polynomial, \ = m — 2/2, and # is the geodesic 
distance between the points P,Q. The expression 





> b, TE” (cos 8) 





with bk = 0 and >), b.7{(1) < @, for the most general positive definite func- 





HILBERT DISTANCES AND POSITIVE DEFINITE FUNCTIONS 655 


tion depending on the geodesic distance alone, has been given previously by 


Schoenberg.” 
More special than the assumption h(k) = 1 is the assumption l(k) = 1. It 


corresponds to the case of the group € being an Abelian group, and in this case 
we simply have 





f(P, Q) = - ax(ex(P)ol@ + oi(P)or(Q)) 


with a, 2 0. 
The symbol sP is a real multiplication of P by s if S is a group space (in which 


case we write x, y, ete. instead of P, Q, etc.) and C is its group of left transforma- 
tions c — sz. Our assumptions concerning © are now that © is a compact 
separable group, the invariant measure on it being now uniquely determined. 
In this case, h(k) = U(k) and but for the factor U(k)"”, gi(x) is Uep(x). Since 


l adh: OR NS l 
p> Uk, mu (L) Uk, mv(Y) = x Uxym(Y)tLi, p(X) 


we see that, except for the factor 1(k), 
Pkur(Z, Y) = Urn (y” 2), 


and hence we obtain the following result. 
TororEM 6. If € is a compact separable group, then f(x, y) is a left invariant 
member of 8 if and only if f(x, y) = f(y™'x) where 


00 1(k) 


(33) f® = 2») > Di.ur Uryur(t) 3 


the matrix | Diy» | being non-negative hermitian and the series (33) in k being 
absolutely convergent. 

In order to obtain right invariant functions we have to put ¢ = yx. Our 
function is invariant on both sides if f(s ‘y~‘xs) = f(y” 'z) that is if f(é) is a class 
function. For class functions the expansion (33) depends only on the group 
characters 


1 
xe(t) = > Uru(t).” 


Thus we obtain 
TaroreM 7. If © is a compact separable group, then f(x, y) is a group invariant 
member of $3, if and only if 


f(z, y) = = bexe(y2) 


‘1. J. Schoenberg, On positive definite functions on spheres. Bull. Amer. Math. Soc., 46 
(1940), p. $88, 
"See Bochner-von Neumann, loc. cit. 
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where by = 0, and >>y bixx(1) < ~; and p(x, y) is a group invariant Hilbert dis- 


tance if and only if 


rte, ot = a(1 — Oe Emer) 1 


with c, = 0; the function p(x, y) is certainly a proper distance if all c, are > 0, 





V. Isometric IMBEDDING OF RIEMANNIAN SPACE INTO HILBERT Space 


In the present section we will draw a conclusion from Theorem 5. We assume 
that our space © is a coordinate space of class C,, r = 2, (continuous partial 
derivatives of order < r), or C,, (derivatives of every order) or C, (analytic 





coordinates), and we further assume explicitly that the functions {¢;,,(P)} 
belong to the same class. By choosing the coefficients of the matrix (23) suffi- 
ciently small we can obtain a series (29) whose sum will belong to the same 
class on SG X S. Now, by a general theorem,’ such a distance can be generated 
by a Riemannian metric. Altogether we have the following 

THEeorEM 8. If © is a compact differentiable manifold (of arbitrary dimension 
and) of class C,, r = 2, or C,,, or Ca, of © is a fixed transitive group of homo- 
morphisms on it, and if the corresponding generalized spherical harmonics {| gx,my(P)} 
belong to the same class, then there exists on S a group invariant positive definite 
Riemannian metric of class C,-2, or C,,, or Cx for which the gwen space can be 
isometrically imbedded in real Hilbert space. 


po & & 


.——n 
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88. Bochner, Differentiable and Riemann metric, Duke Math. Jour., 4 (1938), 51-54. 
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UNITARINVARIANTE HYPERMAXIMALE NORMALE OPERATOREN 


Von Hipecoré Nakano 
(Received November 30, 1939) 


Unitérinvariante hypermaximale Hermitesche Operatoren im Hilbertschen 

Raum sind schon in mehreren Formen gegeben.’ Ich will in dieser Arbeit Uni- 

tirmvariante hypermaximale, normale Operatoren im Hilbertschen Raum 
in einer anderen Form geben. 

Massoperatoren E(Z) eines hypermaximalen, normalen Operators bilden einen 
abgesehlossenen, Abelschen Ring # von Projektionsoperatoren, dessen Defini- 
tion in der friheren Abhandlung’ gegeben ist. Zuerst definieren wir die Dimen- 
sion eines abgeschlossenen, Abelschen Rings. Fiir einen hypermaximalen, nor- 
malen Operator N kann man die komplexe Ebene G in eine Folge von Mengen Z, , 
Z,,Z,, ---derart zerlegen, dass je zweider Z,, , Z:, Z2, -- - keinen gemeinsamen 
Punkt besitzen, und der aus allen Projektionsoperatoren E(Z) (Z C Z,) gebildete, 
abgeschlossene, Abelsche Ring ®, von der Dimension n ist. Diese Zerlegung 
ist bis auf eine Nullmenge iiber E(Z) eindeutig bestimmt. Diese Zerlegung Z,, , 
Z,, Z,,--+ nennen wir das Spektralsystem des hypermaximalen, normalen 
Operators NV. Dafiir, dass zwei hypermaxithale, normale Operatoren N; und Ne 
unitér aquivalent seien, ist es notwendig und hinreichend, dass die Klassen der 
Nullmengen und die Spektralsysteme von Ni und Ne. miteinander tiberein- 
stimmen. 


1, Dimension von Ring. Abgeschlossene Abelsche Ringe nennen wir kurz 
Ringe. SR sei ein Ring, d.h. je zwet Projektionsoperatoren P und Q in R sind 
vertauschbar, P — PQ gehért zu R, und fiir jede Folge P, , P2, --- in R gehdren 
Pit P, 4+... und PyP2 --- beide zu R. 

Derinition 1. Ein Projektionsoperator K heisst mit einem Ring § ver- 
tauschbar, wenn jedes P in mit K vertauschbar ist. 

Derinition 2. Wenn ein Projektionsoperator K mit einem Ring § ver- 
tauschbar ist, so bilden KP (P ¢ ®) auch einen Ring, welchen wir den Nebenring 
von § nennen. 

Da § separabel ist, kann man leicht beweisen, dass die Summe 2) {P} auch 


am R gehdrt.? Diese Summe nennen wir den Mazximaloperator des Rings %. 


‘Vgl. M. H. Stone, Linear Transformations in Hilbert Space, New York (1932). F. 
Wecken, Unitdérinvarianten selbstadjungierter Operatoren, Math. Annalen, 116 (1939). 

*H. Nakano, Uber Abelsche Ringe von Projektionsoperatoren, Proc. of the Phys.-Math. 
Soc. of Japan, 21 (1939). 

*fi, f2, +++ sei tberall dicht in ©, und Pai, Paz, --+ sei eine Folge von Projektions- 
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Indem man den Maximaloperator von t mit M bezeichnet, kann man leicht 
einsehen, dass zwei Nebenringe K® und L&R dann und nur dann tibereinstimmen, 
wenn KM = LM ist, da der Ring K® offenbar mit dem Ring KM¥ iiber- 
einstimmt. 

DerFIniITION 3. Zwei Nebenringe KR und L&R heissen isomorph, wenn fir 
jeden Projektionsoperator P in R KP = 0 mit LP = 0 gleiche Bedeutung hat. 

Derinition 4. Zwei Nebenringe KR und L&R heissen zueinander orthogonal, 
wenn fir je zwei Projektionsoperatoren P und Q in ® stets (KP)(LQ) = 0 ist. 

Man kann leicht einsehen, dass zwei Nebenringe KR und L® dann und nur 
dann zueinander orthogonal, sind wenn (LM)(KM) = 0 ist. 

Derinition 5. Wenn n Nebenringe Ki, KoW, --- , K,R eines Rings § alle 
mit ® isomorph, und je zwei von Ki, K2R, --- , K,R zueinander orthogonal 
sind, und noch n + 1 solche Nebenringe nicht existieren, so heisst n die Minimal- 
dimension von Rt. Wenn beliebig viele solche Nebenringe existieren, so heisst 
von der Dimension ~. 

DEFINITION 6. Rseiein Ring. Wenn fiir jedes P in R die Minimaldimension 
des Nebenrings P® stets n ist, so heisst R gleichmédssig dimensional mit der 
Dimension 7. 

Aus den Definitionen folgt sofort der folgende 

Satz 1. Wenn die Minimaldimension eines Rings R grésser als n ist, so ist die 
Minimaldimension des Nebenrings PR fiir jedes P in R auch grésser als n. 

Satz 2. Wenn die Minimaldimensionen von endlich oder abzdéhlbar unendlich 
vielen Nebenringen Li, LeoR, --- alle grésser als n sind, und L;L; = 0 (i ¥ j) 
gilt, so ist die Minimaldimension des Nebenrings (Li + Le + --- ) R auch grosser 
als n. 

Bewets: Nach Voraussetzung gibt es n + 1 zueinander orthogonale Neben- 
ringe K{PLR, KPLM, --- , Ki"? LM, welche alle mit L&W (¢ = 1, 2, ---) 
isomorph sind. Dann sind n + 1 Nebenringe (L,K{” RO mt ces JR, +--+, 
(L:K{"*? + L,K§"*” + ...)9 offenbar zueinander orthogonal, und sie sind so- 
gar alle mit (Li: + L2 + --- )§ isomorph, denn, aus (L;K{? + LeK$” + ---)P = 
0(P €®) folgt L:K{”P = 0, LoKi”P = 0, --- , und weiter L:P = 0, L2P =0,--- ; 
folglich (Li + Lz + --- )P = 0, und umgekehrt folgt aus (L; + Le + --- )P =0 
(P €®) offenbar (L:K{? + L.K§? +... )P = 0. Daher ist die Minimaldimen- 
sion des Rings (Li + Le + --- )® grésser als n. 

Satz 3. Wenn die Minimaldimensionen von endlich oder abzéhlbar wnendlich 
vielen Nebenringen Lif, Le®, --- grésser als n sind, und je zwei von Ly, Le, --: 





operatoren im Ring §, derart dass 
lim || Patfn || = lu.b. || Rfn || 
imo ReR 


ist. Setzt man P, = i Py, soist Pi S P. S --- , und alle P, gehéren zu XR. Folglich 
i,jsn 
gehort lim P, = M auch zu &, und es gilt || Mf, || = l.u.b. || Rfn || (n = 1,2, -*: )- Daher 
no ReR 
ist M der Maximaloperator in ®. 
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(Li + Le + «++ )®R grésser als n. 
Bewels: Setzt man 


L, = La — La(Li + Le + --- + Lara), 
so gilt 
L+Ib+---=-h+h+---, 


und nach dem Satz 1 ist die Minimaldimension von L, auch grésser als n. 
Daher folgt dieser Satz sofort aus dem Satz 3. 

Satz 4. Wenn ein Ring R von der Minimaldimension n ist, so gibt es zwei 
Projektionsoperatoren P und Q in ®R, derart dass PQ = 0, der Nebenring PR 
gleichmissig dimensional mit den Dimension n, die Minimaldimension des 
Nebenrings QR grésser als n ist, und P + Q mit dem Mazximaloperator M von R 
tibereinstimmt. 

Bewels: Wenn § gleichmissig dimensional mit der Dimension n ist, so 
braucht man nur P = M zu setzen. Im anderen Falle bilden alle Projektions- 
operatoren F in Rt, fiir welche die Minimaldimension des Nebenrings RR grésser 
als n ist, nach den Saétzen 1 und 3 einen Unterring ®: von ®. Bezeichnet man 
den Maximaloperator von ®t, mit Q, so gehért Q zu KR, , und die Minimaldimen- 
sion des Nebenrings Q& ist grésser als n, und fiir P = M — Q ist der Nebenring 
PR offenbar gleichmassig dimensional mit der Dimension n. 


2. Unitérinvarianz. Es sei N ein hypermaximaler, normaler Operator mit 
dem Massoperator E(Z). Wenn man mit ® den aus allen #(Z) gebildeten Ring 
bezeichnet, so kann man nach dem Satz 4 die komplexe Ebene G in abziahlbar 
unendlich viele, messbare Punktmengen Z,,, Z:, Z2,--- derart zerlegen, 
dass der Nebenring E(Z,)R gleichmassig dimensional mit der Dimension 
(= 0, 1,---) ist ndG@ = Z,+44+ 24+---, 2:2; = 0 (i $j) gilt. 
Diese Zerlegung Z,,, Z:1, Z2,~--- der komplexen Ebene G nennen wir das 
Spektralsystem des hypermaximalen, normalen Operators N. Man kann leicht 
einsehen, dass das Spektralsystem bis auf eine Nullmenge tiber H(Z) eindeutig 
bestimmt ist. 

Satz 5. Dafiir, dass zwei hypermaximale normale Operatoren N, und Ne unitdr 
dquivalent seien, d.h. dafiir, dass fiir einen passenden unitéren Operator U N; = 
U*N3U bestehe, ist es notwendig und hinreichend, dass die Klassen der Nullmengen 
und die Spektralsysteme von N, und Nz bis auf eine Nullmenge tibereinstimmen. 

Mit F,(Z) und E:(Z) bezeichnen wir den Massoperator baw. von N, und 
N:. Wenn fir einen unitaren Operator U N, = U*N,U ist, so erhalt man nach 
der Eigenwertdarstellung 


[ zd(E(Z)f, 9) = [ 2d(U*E,(Z)Uf, 9) 


miteinander vertauschbar sind, so ist die Minimaldimension des Nebenrings 
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fiir jedes f im Definitionsbereich von N; und jedes g in §, folglich nach der 


Eindeutigkeit des Massoperators ei 

E,(Z) = U*E,(Z)U. 7 

} 

Daher kann man leicht einsehen, dass die Klassen der Nullmengen iiber E,(Z) he 
und £,(Z) und die Spektralsysteme von N; und Nz bis auf eine Nullmenge 

ubereinstimmen. di 

Fiir den Beweis der Hinreichendheit brauchen wir folgende Hilfssitze. n 


Hitrssatz 1. sei ein Ring mit dem Maximaloperator M. Es gibt mindestens 
ein Element f in © derart, dass fiir jedes R(¥ 0) in R stets Rf ¥ O ist. Ein solches 


Element f nennen wir ein eigentliches Element in bezug auf §®. . 
Bewels: Da § separabel ist, gibt es eine Folge von Elementen f, , fo, .-. r 
derart, dass © von fi, fo, --- aufgespannt wird, und || fi ||’ + || fo ||? + --- kon- 
vergiert. Alle Projektionsoperatoren FR in ®, fir welche Rf, = 0 ist, bilden F; 
zusammen einen Unterring ®, von . Wenn man den Maximaloperator von P 
®, mit H,, bezeichnet, so kann man leicht einsehen, dass ®,, mit dem Nebenring 
E,W iibereinstimmt. Daher ist (MVM — E,)Rf, ¥ 0 fiir jedes R in R wenn fi 
(M — E,)R # Oist. Setzt man FE, = E,E, --- E,, so konvergiert : 
(M — Ei)fi + (Ei — Es)fe + (Ea — Ea)fs + ++ V 
nach einem Element fo, da || (Ex. — Ex)fs |’ S || fs ||? ist, und || f, |? + gh 
|| fo ||? + --- konvergiert. Dieses Element fo ist ein eigentliches Element in " 
bezug auf §t, denn, wenn fir einen R in R Rf = 0 gilt, dann ist zugleich k 
(M — Ei\)Rf,=0, (Ei — E:)Rfo=0, (E2 — E3)Rf = 0,-:-, m 
folglich muss al 
(M — E;)R=0, (M — E,)(Ei — E:)R=0, (M — E;)(Ey — E;)R =0,--- m 
: Ki 
sein. Da te 
(M — E,)(Ex-1 — E),) = (M — E,)(By «++ Ena — Ey +++ En1En) Ww 
= F,--- Eya — Ej -++ Beak, = Ena - E, : 
ist, erhalt man a 
(M—E,)R=0, dh MR=E,R P 


folglich 
MR = Riim E£,,. 


n->o 


—, £ 


Andererseits gilt (lim E,)f; = 0 (i = 1, 2,---), und fi, fe, fs, «++ spannen § be 
auf. Daher muss lim E;, = 0 sein, folglich R = MR = R(lim E,) = 0. R 
Derinition 7. Zwei in bezug auf ® eigentliche Elemente f und g heissen (@ 


eigentlich orthgonal, wenn fir jedes R in R(Rf, g) = (f, Rg) = 0 gilt. 
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DeriniTion 8. & sei ein Ring mit dem Maximaloperator M. Wenn fir n 
eigentliche Elemente fi , fa, +++ 5 fn (n = ©,1,2,---), von denen je zwei zuein- 
ander eigentlich orthogonal sind, die abgeschlossene lineare Mannigfaltigkeit des 
Maximaloperators aus allen {Rf,, Rf2,---, Rfn} (R «§R) aufgespannt wird, so 
heist f:, fo, -*: » fn ein Basensystem des Rings ®. 

Hitrssatz 2. Wenn ein Ring R mit dem Mazimaloperator M gleichmdssig 
dimensional mit endlicher Dimension n ist, so besitzt R ein Basensystem, das aus 
neigentlichen Elementen besteht. 


Bewets: Da die Dimension von § n ist, so gibt es zueinander orthogonale, 
mit 9 isomorphe Nebenringe AiR, KeR, ---, K,9 mit den Maximalopera- 
toren K,, K2,---, Kn. Wenn man mit f; ein eigentliches Element in bezug auf 
K& bezeichnet, so sind je zwei der Kifi, Kefe,---, K,f, offenbar eigentlich 
orthogonal. und K,f; ist ein eigentliches Element in bezug auf Kt. Bezeichnet 
man mit K; den Projektionsoperator der von allen {RK;f;}(R ¢ R) aufgespannten, 
abgeschlossenen, linearen Mannigfaltigkeit, so ist offenbar M = Ki + K. + 
... + K,.Wenn M — (Ki + Ke + --- + K,) # 0 ist, so bilden alle Projek- 
tionsoperatoren R in R, fir die (M — (Ki + K.+--- + K,)]R = Oist. einen 
Ring mit dem Maximaloperator Ry, und es ist M — Ryo # 0. Dann ist die 
Minimaldimension von (M — Ro)® grésser als n, was unmdglich ist, da R 
gleichmiissig dimensional mit der Dimension n ist. Daher ist M = Ki + K2 + 
+ K, dh. fi, fo, «++, fn ist ein Basensystem von §. 

Hitrssatz 3. Wenn ein Ring KR mit dem Mazximaloperator M ein aus n 
Elementen bestehendes Basensystem fi , fo, --+ , fn besitzt, so ist R von der Mini- 
maldimension n. 

Beweis: Wenn man mit Mt die dem Projektionsoperator Yt entsprechende, 
abgeschlossene, lineare Mannigfaltigkeit bezeichnet, so gibt es einen hyper- 
maximalen, normalen Operator N in 2, dessen Eigenring mit § iibereinstimmt.* 
Fir jedes fo in IN lasst sich jedes Element in der aus allen Rfy(R « R) aufgespann- 
ten, abgeschlossenen, linearen Mannigfaltikeit in der Form (N)fy darstellen, 
wo (NV) eine Funktion von N, und ®(N)fo sinnvoll ist. Da Mfi, Mfr, ---, 
Mf, auch ein Basensystem von § ist, so kann man annehmen, dass fi , fo ,.--- , 
f, alle zu 2 gehéren, und wir betrachten daher nur den Raum M. Wenn die 

‘Satz 8 in der Abhandlung: H. Nakano, Uber Abelsche Ringe von Projektionsoperatoren, 
Proc. of the Phys.-Math. Soc. of Japan 21 (1939). 


*Vgl. M. H. Stone, Linear Transformations in Hilbert Space, New York (1932). Man kann 
es auch wie folgt einsehen: Alle iber E(Z) messbare Funktionen #(z), fiir die das Integral 


| (z) |? d || E(Z)fo ||? konvergiert, bilden nach dem Riesz-Fisherschen Satz einen Hil- 
G 





bertschen Raum mit dem inneren Produkt / ©, (z)#:(z) d || E(Z)fo |\?. Dieser Hilbertsche 
G 
Raum ist mit der aus allen &(N)fo bestehenden, linearen Mannigfaltigkeit isometrisch, da 


((N)fo, &2(N)fo) = / $1(z)&2(z) d || E(Z)fo ||? ist. Daher bilden #(N)fo eine abge- 
@ 


schlossene lineare Mannigfaltigkeit, die offenbar von allen E(Z)fo aufgespannt wird. 
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Minimaldimension von § grésser als n ist, so kann man wie im obigen Beweis 
leicht einsehen, dass zueinander eigentlich orthogonale meer P1, G2, ++ 
gm (m > n) existieren. Da Mt von allen {Rfi, Rf, ---, Rfn}(R eR) aufges- 
pannt wird, so kann man schreiben: 


qi = Ou fi + Sg + Pinfn 
en le geeks ease «6h enee i aeame (m > n) 
Pm = Omifi tees + PinnJn 


wo %;; passende Funktionen von N sind, daher sind %;; alle hypermaximal, ihr 
41=1,2,---,m\. : 
Fae 2m) sind mit 
einander vertauschbar. Da #;; hypermaximal ist, so gibt es fiir jede positive 
Zahl p einen Projektionsoperator R{?? in R derart, dass || &;,R{?’f || < p || RE&?’s | 
fiir jedes f in M, und = Ri? = M ist. Setzt man R™ = [] R§?’, 50 gilt 


Fi 
lim R = M. Tabet: iat R” + 0 far eine geniigend grosse, positive Zahl p, 


po 


und #;;R™ sind alle beschrinkt. Aus (*) erhalt man 


R” 9, = Rf, + oe + $,,R fy, 
a sald «ate coils SWORE pista We oe ee (m > n) 
RQ? Om = Pm fy + oss + Ona fy, 


Eigenring ist in ® enthalten, und je zwei der 2,,(3 


wobei R”y; ¥ 0 (i = 1, 2,---, m) ist, da alle o, go, --- , om eigentliche 
Elemente in bezug auf 9 sind. Daher verschwinden nicht alle Operatoren®,,R” 
identisch in §. Nun kann man wie in der Algebra beschrankte Operatoren \,, 
We, ---, Wm in der Form von Determinanten mit den Elementen &; ;R” ’ derart 
bestimmen, dass 


VR, + WR os + +++ + WR Om = 0 


ist, und nicht alle VR”, %R”,.-., Yak” identisch in M verschwinden. 

Da je zwei der gi , g2, +++ , Om cuskiiatlign eigentlich orthogonal sind, so kann 

man durch Pry ppt leicht einsehen, dass je zwei der WR 9, 
VeR gy, , +++, Vm om zueinander orthogonal sind. Daher muss 


WR”, = 0, VR” go, = 0, ++: ; VrR” om = 0 


sein. Danicht alle Operatoren ¥,R™, VR”, see, V,,R” identisch verschwin- 
den, kann man annehmen, dass ¥,R™ nicht identisch verschwindet. Dann 
gibt es einen Projektionsoperator R; (~0) in ®, sodass fiir jedes f in Mt aus 
Rif #0, UR® RE 6 0 folgt, da ¥:R® auch eine Funktion von N ist. Fir 
dieses R; gilt WR” Riv: = 0, folglich Ry: = 0, was unméglich ist, weil 
ein eigentliches Element in bezug auf 9 ist. 

Hivrssatz 4. Ein Ring R mit der Dimension « hat ein Basensystem fi, 
ee 

Bewels: f,, fe, --- seien in $ iiberall dicht. Alle Projektionsoperatoren F in 
®, fir welche Rf, = 0 ist, bilden einen Ring mit dem Maximaloperator 1. 
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Es gibt nach dem Hilfssatz 2 ein eigentliches Element g; in bezug auf dem 
Nebenring Ri®. gi = fi + Rigiist dann ein eigentliches Element in bezug auf ®, 
denn, da (Rf: , RRigi) = (RAsfi, ¢1) = O ist, folgt aus R(fi + Rigi) = 0 fiir ein R 
in RR = 0; RRe = 0, und weiter RR, = R, RR, = 0, d.h.R = 0. 

Bezeichnet man mit. K, den Projektionsoperator der von allen {R(f; + Rig:)} 
(Re§) aufgespannten, abgeschlossenen, linearen Mannigfaltigkeit, so ist der 
Nebenring (1 — Ki) mit R isomorph, denn, ware (1 — K,)R = 0 fir ein 
R(# 0) in , so miisste die Minimaldimension des Nebenrings KiR® gleich 1 sein, 
da f: + Rig: ein Basensystem von KiR¥ ist, was nach dem Satz 1 unméglich ist. 

Alle Projektionsoperatoren R in ®t, fir welche R(1 — K,)fe = 0 ist, bilden 
einen Ring mit dem Maximaloperator R,. Es gibt nach dem Hilfssatz 2 ein 
eigentliches Element ge in bezug auf dem Nebenring R2®. g2 = (1 — Ki) (fo +Rege) 
ist dann ein eigentliches Element in bezug auf ®, das zu fi + Rig: eigentlich 
orthogonal ist. 

Bezeichnet man mit Ke den Projektionsoperator der aus allen {R(1 — K,)ge} 
(Re®) aufgespannten, abgeschlossenen, linearen Mannigfaltigkeit, so kann 
man auch wie oben schliessen, dass [1 — (K; + Kz2)]9# mit ® isomorph ist. 
Abnlich erhalt man eine Folge von eigentlichen Elementen gi, gz, --- , von 
denen je zwei zueinander eigentlich orthogonal sind. Bezeichnet man mit M 
den Maximaloperator von t, so gehéren alle Mf, , Mf2,---, Mf, zu der von 
allen {Rg:, Rge,---, Rgn}(R eR) aufgespannten, abgeschlossenen, linearen 
Mannigfaltigkeit. Daher ist gi, ge,--- ein Basensystem des Rings 9. 


Bewels pes Satzes 5. N, und Ne seien zwei hypermaximale, normale 
Operatoren mit den Massoperatoren. #,(Z) und E,(Z). Jede Nullmenge 
iiber einem der E;(Z) sei auch Nullmenge iiber dem anderen, und Z,,, 
Z,, Z,, --» sei ein gemeinsames Spektralsystem von N; und N;. Wenn 
man die Eigenringe von N; und Ne bzw. mit R; und MR: bezeichnet, so besteht 
der Nebenring F(Z,,)®; aus allen {£,(Z)}(Z C Z,), und der Nebenring F2(Z,)R» 
aus allen {#2(Z)}(Z CG Zp)(n = ©, 1, 2,-++). fna,Sna,°**)Snm UNd Gna, 
§n2)***, Jn Seien Basensysteme bzw. von F,(Z,)%: und £2(Z,)R2 (n = 
*, 1, 2,.--). Fir beliebige beschriinkte Funktionen 

(Ns) (; eae te i) 
sind dann je zwei der $; (Ni)E.(Z;)f;,; zueinander orthogonal. Folglich bilden 
alle endlichen Summen aus ihnen eine lineare Mannigfaltigkeit tj. 1 ist 
iberall dicht in §, da Massoperator E,(Z) auch eine beschrinkte Funktion von 
Mist. Die zwei Mengenfunktionen || £,(Z)Ei(Zn)fn,, ||’ und || H2(Z)E2(Zn)gn,; |{" 
verschwinden fiir eine Menge Z beide zugleich, wenn eine von ihnen verschwindet. 
Daher gibt es eine messbare Funktion W,,;(z), so dass 


| Ey (Z)E(Zn)fnj \[? sad [ | W,;(z) Pd {| E,(Z) E2(Zn)9n.i ||? 


(/n = 0,1,2,--- ) 
\j= £2%-..°*? 
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ist.© Dann ist f,,; = Vn,;(N2)E2(Zn)gn.i (7 = 1, 2, «++, 2) auch ein Basensystem 
von E2(Zn)R2, und es gilt 

|| Fi(Z)Es(Zn)fn.i || = || B2(Z)Ea(Zn)fn.i |I. 
Wenn man jeder beliebigen, endlichen Summe f; =  & $1 j(N1)Ei(Zn)fn,; die 
entsprechende Summe fz = p> , ;(N2)Eo(Zn)fn.i suicuiiout so ist diese Zuord- 


nung langentreu, da 


fll’ = II > }yj(Ni)Ex(Za)fa.i ll = p> || Pas(N1) Ei (Zahn. ||? 


=E | lex@ fal MORE 


Ns) 


=D] lex@ fall BOBS 
= ©nj(N2) Ex(Zn)fn.i | = l|Fell 


Die > ®,;(N2)E2(Zn)f.,; sind tiberall dicht in §. Daher kann man diese 


Suualtens zu einer unitéren Zuordnung fe = Uf; eindentig erweitern. Fir 
jede charakteristische Funktion A(z) einer messbaren Menge Z ist E,(Z) = 
A(Ni), E2(Z) = A(Ne2). Daher erhalt man 


UE,(Z) { p> ©, j(Ni)Ex(Zn)fn,i} bie U} > Bnj(Ni)A(N1) Ei(Zn)fn.i} | 
= > ®y;(N2)A(N2)Ea(Zn)fn.i = Ex(Z) { > Baj(N2) Ea(Zn)fn.i} 
as E,(Z) U { »» @nj(NiEi(Zn)fn.i} “ 


Folglich gilt fiir jede messbare Menge Z 
E,(Z) = U*E,(Z)U, 


und hieraus kann man durch Eigenwertdarstellung leicht schliessen, dass 
N, = U*NQU ist. 





IMPERIAL UNiIveRsITY, Tokyo, JAPAN. 





* Radon-Nykodymscher Satz. Vgl.S. Saks, Theory of the Integral, Warszawa (1937), 8. 36. 
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UBER DEN BEWEIS DES STONESCHEN SATZES 


Von Hipecoré Nakano 
(Received February 13, 1940) 


Der wohlbekannte Stonesche’ 
Sarz. U,(— 0 <t < + ©) set eine Schar unitdrer Operatoren im Hilbert- 
schen Raum $. Wenn stets 


UU, we? Ut4e 


gilt, und fiir jedes f und g in S (U;f, g) als eine Funktion von t messbar ist, so gibt 
es eine Zerlegung der Einheit E(d), derart dass 


+o > 
U.=[ &* aE. 
wurde von J. von Neumann,” M. H. Stone,’ S. Bochner,‘ F. Riesz’ und B. von Sz. 
Nagy’ bewiesen. Unter diesen Beweisen beruht nur der von Neumannsche auf 
keinem Satz iiber Fouriersche Integrale oder Fouriersche Reihen. In dieser Ab- 
handlung soll der von Neumannsche Beweis wesentlich vereinfacht werden— 
insbesondere wird ein besonderer Beweis der Stetigkeit von U; in ¢ und die 
damit zusammenhangende Bezugnahme auf die reelle Funktionentheorie ver- 
mieden. 
Beweis. Wie von Neumann Zeigt, kann man durch 


(1) ne f. e*U;dt 


einen beschriinkten Operator erhalten. Es gilt dann weiter 


A* = F e'U, dt 


+00 
U,A = AU, = é | e'U,dt 


AA* =A*A, A+ A* = 2AA*. 
Ferner gilt: 
(5) aus Af=0 folgt f = 0. 


'M. H. Stone, Proce. Nat. Ac. 16, February 1930. §. 173-174. 
*J. von Neumann, Annals of Math. 33, 1932. §. 567-573. 
*M. H. Stone, Annals of Math. 33, 19382. S. 643-648. 

‘8. Bochner, Sitz. Preusz. Akad. 1933. 

*F. Riesz, Acta Sci. Math. Szeged, 6, 1934. 

*B. von Sz. Nagy, Math. Annalen 112, 1936. S. 291-294. 
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Denn wegen (3) folgt fiir jedes g 
+00 
[ Ws, pa =0. 


Da s beliebig ist, gilt (U:f, g) = 0 bis auf eine Nullmenge: in ¢t. Dann kann 
man nach Separabilitat von § leicht beweisen, dass bis auf eine Nullmenge 
in ¢ fiir alle g gleichzeitig (U.f, g) = 0 dh. Uf = 0, folglich ||f|| = 
|| Usf || = 0. 

Sei nun 


(6) W = -2A +1. 


Dann folgt aus (4) W*W = WW* = 1, d.h. W ist unitér. Und wegen (5) 

folgt aus Wf = f,f = 0. Daher ist W nach J. von Neumann,’ 8. 91, 92 die 

Cayleysche Transformierte eines hypermaximalen Hermiteschen Operators 
+00 


H = \dE(a), 


fiir eine geeignete Zerlegung der Finheit E(A). Die ebendort durchgefiihrten 
Rechnungen ergeben weiterhin 


X14 
wef 3 + 
Im Hinblick auf (6) folgt hieraus 


(7) A= ‘i 5 dev. 


Setzt man 


View f. e d(x) 


fiir diese Zerlegung der Einheit E(A), so ist V; unitar und stetig in ¢, und es gt 
(8) Vi+e = V; VY, 


(9) Am [ ” Vat 


0 
(10) A* =| eV. dt 


+00 
(11) Via a at ey / eV, dt. 


Setzt man fiir eine positive Zahl n(>1) E, = E(n) — E(—n), und bezeichnet 
den Wertvorrat von E, mit QM, , so gilt fiir jedes f in M, nach (7) 


A= | ao. 





7 J. von Neumann, Math. Annalen 102, 1929. 8S. 49-131. 
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UBER DEN BEWEIS DES STONESCHEN SATZES 
Infolgedessen definiert in Mt, 

A‘ = [ (1 — A) dE() 
eine Reciproke A von A, die ausserdem beschrankt ist: Fir alle f in M, ist 


sal 1 
ANSI S 2miifl, AFI 2 SII. 
(1), (3), (9), (11) verbindend kann man auch schreiben, dass 


AU, =eA — é | e 'U,dt 


AV,=¢A-— é | e ‘Vi dt, 
daher 
A Se mae I e(U, — V,) dt. 


Da die abgeschlossene lineare Mannigfaltigkeit I, beide unitire Operatoren 
U, und V, reduziert, so gilt fiir jede positive Zahl s 


1 \\(U.— Vasil $ AW. — Vasil se [ el . - vasa? 
0 
Setat man g(s) = e* || (U, — V,)f ||, so ist 


1 
12 Fc 
(12) Os 5, 79) < | g(t) dt, 
folglich 


= ; e ™*o(s)ds < [ jem] [ g(t) ax}} ds 


1 —2ns ? 1 4 —2ns 
= an? I g(t) dt +5 | e* (s) ds, 


und somit 


2n 


Daher muss nach (12) g(s) = 0 sein, d.h. U,f = V.f fiir jedes fin M,. Fir 
beliebiges f in © gilt auch U.E,f = V.E,f. Da lim E, = 1 ist, erhilt man 


no 


1 pin i] g(t) dt = 0. 


U.f = V.f, was zu beweisen war. 
Hier danke ich Herrn Prof. J. von Neumann fiir freundliche Verbesserungen 
und Bemerkungen. 





‘Wenn (4/, 9) = | (Waf, 9) dt ist, so gilt 


| 

ai ois | f Lams o Lael =| fms ite hae = the f Awe tae). 
Setzt man g = Af, so erhilt man 

Waris | fives tia. 
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ON THE TOPOLOGICAL STRUCTURE OF SOLVABLE GROUPS 
By CLaupE CHEVALLEY 


(Received November 1, 1940) 


E. Cartan has shown that a simply connected solvable Lie group is homeo- 
morphic to some cartesian space.’ We want to investigate the structure of a 
solvable Lie group which is not necessarily simply connected. 

From a well known theorem,” it follows that such a group may be considered 
as the factor group of a solvable simply connected group G by a discrete sub- 
group D of the center of G. Therefore what we have to do is to look for the 
possible “situation” of such a sub-group D in the group G. 


I. Norations. GENERALITIES 


G being a Lie group, to every element L of its Lie algebra % there corresponds 
a one parametric sub-group o = a(t) of G: we have o(t; + t&) = o(t)o(t), and L 
is the tangent vector at the unit element to the differentiable curve described 
by a(t). Moreover, replacing L by AL, where ) is a real number, does not change 


the geometric curve a(t), but replaces the parameter ¢ by xt 


Extending to general Lie groups a notion which is well known for matrices, 
we shall denote by exp L the point of parameter 1 on the curve a(t). Therefore 
we have (exp tL) (exp t’L) = exp (¢ + ?’)L. 

If we transform the elements of the group exp Lt by a fixed element o, we 
obtain a new one-parametric group exp L’t; the mapping L — L’ is a linear 
mapping A, of the Lie algebra & into itself; the mapping o — A, is a linear repre- 
sentation of the group G, which is called the adjoint representation. Any central 
element of G is obviously represented by the identity operation. 

Suppose that o describes itself a new one parametric group exp Mu. Then 
L’ becomes a function of u, and we have L’ = L + u[M, L] + --- , the terms 
not written being of order > 1. 

The mapping L — [M, L], for M fixed, is a linear operation Ay in %; the map- 
ping M — Ax is called the adjoint representation’ of the Lie algebra; we have 


Au) = AwAu — AuAw . 





1 It follows at once from the lemma proved by E. Cartan on page 20 of the pamphlet 
“La topologie des groupes de Lie’”’ (Exposes de geometrie, Act. scient. et indus. no. 358, 
Hermann et Cie, Paris, 1936). See also Theorem 83, p. 277 in Pontrjagin, “Topological 
groups” (Princeton University Press, 1939), Theorem 61, p. 225. 

* See Pontrjagin, |. c. 1), Theorem 83, p. 277. 

*Our notations do not coincide entirely with the usual ones. It is the mapping 
M — —Ay which is generally called the adjoint representation. 
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Let H be a closed invariant sub-group of G; it is also a Lie group, and its Lie 
algebra is an invariant sub-algebra—or ideal—in &. Moreover G/H is also 
a Lie group, whose Lie algebra is the quotient algebra °/. If L, L’ are two 
elements of 2, and if L is a congruent to L’ modulo §, i.e. if L’ — L ¢ S then the 
elements exp L, exp L’ belong to the same coset of G modulo H. 


II. SotvaBLeE Groups / 


A Lie algebra & is solvable* when there exists a chain § C $C --- C §, of 
ideals in 2, beginning with Sp = {0}, ending with §, = &, such that the factor 
algebras ©:/Oi.a (1 S 7 S 1) are all abelian. Every sub-algebra or factor- 
algebra of a solvable Lie algebra is again solvable. 

The derived algebra &’ of a solvable Lie algebra 2 = &, of dimension n > 0 
is always different from %. Since %/{’ is abelian, every linear sub-space of 2 
containing %’ is an ideal, and we conclude that % has an ideal %,,_; of dimension 
n—-1. The algebra &,-: is also solvable if n — 1 > 0; it has itself an ideal 
Q,-2 of dimension n — 2, and we may continue the process. Hence we have a 
chain® = {0} C&UCRCc--- Cc & = L of sub-algebras of &, where &; is of 
dimension 7 and contained as an ideal in &;,; (although &; is in general not an 
ideal in 2). If for every 7 (1 S 7 S n) we choose an element L; belonging to &; 
but not to &;1 , the m elements LZ, , Ze, --- , Ln constitute a base for &. 

Suppose now that & is the Lie algebra of a connected simply connected group 
G. Then every element in G can be written in one and only one way in the 
form (exp 4J,1)(exp &Le) --- (exp tpn), where t:,%,---,t¢, are real num- 
bers.” Let 9 be any linear sub-space of &. We may choose the elements 
I,, I,,-+- , I, in such a way as to satisfy the following condition: if there 
exists in %; an element of 2% which does not belong to %;_; , L; shall be chosen 
inM. If Li, = Mi, Li, = Mz,---, Li, = M, are the elements L; which 
belong to Mt, the elements M,, M;, --- , M, obviously form a base for M. 

Suppose now that Mt is a sub-algebra. Let k, 1 be two indices such that 
13k <I1sr; then [M;,, M,] is a linear combination of Mi, M2,---, M,; 
on the other hand, it is also a linear combination of Li, , Li, , --- , Liy_; ; 
hence it is a linear combination of M,,M:,---,Mii. It follows that 
M,, M2, --- , M; constitute a base of a sub-algebra I, of Mt of dimension |, 
and that Nt,1 is contained as an ideal in Mt, : the base M,, Mz, --- , M, has 
bir respect to Dt the same properties as the base Li , Lz, --- , L» with respect 
to % 

Suppose that 2 is the Lie algebra of a simply connected group G. Then the 
elements (exp %4M;)(exp wMz) --- (exp u-M,) (wi, ue, +++ , &, real) constitute 
a closed sub-set M of G, homeomorphic to the r-dimensional cartesian space. 
On the other hand, we know that there exists an abstract Lie group M’ whose 
Lie algebra is 2 and that every element in M’ can be written in one and only 





‘Or integrable in the sense of 8. Lie or E. Cartan. 
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one way in the form (exp uiM)) --+ (exp u,M,). It follows at once that M is 
a sub-group of G. 

Therefore: If G is a simply connected solvable Lie group, to every sub-algebra In 
of the Lie algebra of G there corresponds a closed simply connected sub-group M of 
G whose Lie algebra is M. 


III. Stupy or A ParticuLtaR Type or SOLVABLE GRouP 


Let G be a simply connected solvable Lie group which has the following 
properties :° 

1) The Lie algebra & of G, of dimension n, has an abelian ideal % of dimension 
n—-1. 

2) There exists an element o of the center of G which does not belong to the 
connected sub-group @ of G whose Lie algebra is 2%. 

We shall first prove that there exists an element L ¢ % such that exp L = o. 
In fact the group G/A is an abelian connected group of dimension 1; hence there 
exists an element L of 2/% such that exp L = a, where ¢ is the coset modulo @ 
which contains ¢. If L; is any element of 2 whose coset modulo & is LZ, we have 
exp L; = on, n €@. If tis any real number, exp ¢L; commutes with exp J, and 
with o, hence also with 7. On the other hand, @ being abelian and connected, 
there exists one and only one element A ¢ % such that 7 = exp A. In the 
adjoint representation, the operation which corresponds to exp tL, changes A 
into an element A; such that exp A; = 7 = exp A; hence A; = A, which means 
that [AZ] = 0; it follows that exp (LZ; — A) = exp L,; (exp — A) = o; we may 
take L = L, — A. 

Consider now the group & composed of the elements exp tL. Since exp L 
belongs to the center, the adjoint representation maps © onto a group of linear 
operators in % which is compact. The operators of this group permute among 
themselves the elements of %; since the group is compact, we have complete 
reducibility,° i.e. we can find a decomposition % = %, + %+--- + Mof A 
into a direct sum of linear spaces % , %,--- , %, each one of which gives an 
irreducible representation of ©. Since G is a one dimensional group (isomorphic 
to the additive group of real numbers), each YI; is of dimension 1 or 2; suppose 
that %1, %,---, % are of dimension 2, %.s:, --- , Xs being of dimension 1. 

The operation A, which corresponds to L in the adjoint representation per- 
mutes among themselves the elements of %; ; let \; be one of its characteristic 
roots in %;. Then e“ is a character'stic root of the operation which represents 
exp L = o; hence e’ = 1, A; = 2xnix/—1, where n, is an integer. If ni # 0, 
\; is not real, and —), is also a characteristic root of Az in Y; ; hence dim Yi = 2; 
if n; = 0 on the contrary, we have dim A; = 1. 





5 The simplest example of such a group is the simply connected covering group of the 
group of motions in the plane. 
° Cf. Pontrjagin, 1. c. 2), Theorem 23, p. 110 and I, p. 109. 
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Moreover, if « $k we can find a base A;, B; of YA; such that A,(A; + 
o/—1 Bi) = 2anir/=1 (Ai + V—1 Bi), ie. 


[A;, L] = —2xn,B; 


[By L] = 2un.A; (n; # 0). 


Ifi > k, we have 
[A;, L] = 0. 


Hence the structure of 2 is entirely determined when the integers n; are known; 
the linear space %i41 + --- + Wis the center of &, and % + %+---+ W% 
is the derived algebra of &. 

It is easy to build from these data a system of finite equations for the group. 
We have n variables t, u;, v; (1 S 7 S k), wi (k < i S h) which constitute a 
system of coordinates on the group; o; , o2 being any two elements of the group, 
the law of composition is defined by the formulas 


t(o1 , 02) = t(o1) + t(o2) wi(oro2) = exp (—2rnir~/—1 t(o2)) — (wi(or) + wi(or)) 


where we have set ws = uz + +/—10,if 1 $i Sk.’ The coordinates of o are 
(1, 0, rae 0). 

We can also find the explicit formulas for the coordinates of exp X, if 
X = aL + Yin (iA: + 1B) + Doty aA; : we find 





t(exp X) = a 
flsisck 
1 —exp(—2rnv/—1a) . 
; —1b ——— if nid) ¥ O 
wiexp X) = 1% +'V—1 b) SRR re 1:0 
a+vV¥-—1); if na = 0 
fi>k wi(exp X) = a;. 


Hence, the elements X for which the equality exp X = o holds are those of the 
form L + >-i (a:A; + b,B,), and only those. 

We shall need later the following consequence of these facts: let + be any 
automorphism of the group G which leaves unchanged the element o; if we 
denote by A, the linear operation® on 2 which corresponds to 7, A, permutes among 
themselves the elements of the sub-algebra Dt generated by L and the elements 
A;, B; for 1 < i S k; if X is any element of I such that X = L (mod Y%), we 
have again exp X = go. 





"In order to simplify the writing of formulas, we are using complex valued functions w; , 
although the group under consideration is of course a real Lie group. 
*This operation is defined just as in the case of the adjoint-representation. 
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IV. Tue Centra, THEOREM 


We want to prove the following result: 

TurorEM 1. Let G be a connected simply connected solvable group, with the 
Lie algebra 2. Let D be a discrete sub-group of the center of G. Then D isa 
free abelian group of a certain rank r at most equal to the dimension n of G. It 
is possible to choose a base L, , Le, --+ , Ln of which has the following properties: 

1) Every element in G can be written in one and only one way in the 
form (exp tL;)(exp tL2) --- (exp tnLn). 

2) There exists a subset {i,, t2,--+ , tr} of the set {1, 2,---, m} such that the 
elements exp Li, , exp Li,,-+- , exp Li, constitute a base for D, and that [L;, , 
Li,] = 0(1 s Qa, Bs r). 

We shall prove this theorem by induction on the dimension n of &%. Let us 
assume that it has been proved already for all dimensions < n. 

Since & is solvable, it has certainly an abelian ideal of dimension > 0; let % be 
an abelian ideal of maximal dimension; we set {* = &%/%. Let A be the con- 
nected abelian sub-group of G whose Lie algebra is %; A is an invariant sub- 
group, and G/A is a solvable group of dimension < n. The group DA/A is 
a sub-group of the center of G/A. 

Consider now the closure DA of the group DA. The connected component 
of the unit element « in this closure is a closed sub-group of G; according to a 
well known theorem of E. Cartan, it is a Lie group; let %’ be its Lie algebra. 
The group DA being abelian and invariant, 2’ is an abelian ideal in % containing 
%; in virtue of our choice of %, we have 2%’ = %, which shows that the connected 
component of ein DA reduces to A. On the other hand, there is a neighborhood 
of ein DA which is compact; this neighborhood can meet only a finite number 
of distinct connected components of DA, from which it follows at once that 


there exists a neighborhood U of ein G such that UM DAC A. Any element 
of DA which belongs to the set UA already belongs to A; hence the group DA/A 
is a discrete sub-group of G/A. It follows that we can apply our theorem to 
G/A and DA/A: we obtain a base {Li , Lz? , --- , La} of &/% with the following 
properties: 

A) Every element of G/A can be written in one and only one way in the 
form (exp tL1)(exp us) - (exp tmln). 

B) There exists a subset ti prc, ig} of the set {1, 2, --- , m} such that 
the elements exp. Li, , exp Li, , ---, exp Li, constitute a Sein for the free 
discrete group DA/A, and that [Li y Li,] =0O(1 <5 a,68 S88). 

For each a, we can find an element o4 €D such that its coset «, modulo A 
coincides with exp Ly. 

We want now to ines i in a certain way elements J; , Ls corey =. & which 
will have to satisfy the following conditions: A) The bbe ‘of L; modulo 
M is Le (1 Si Sm); B) we have exp L,, =o;,, (lS ads); C) we have 
[Li,, Li] = 0(1 S a, BS 8). 


Suppose that we have already chosen L,, Le L, in such a way as to 
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satisfy the conditions of the type A), B), C) which concern these elements. We 
want to choose Lay: (If h = 0, we assume simply that no choice has been made 
yet 

4 + 1 does not occur among the indices 41, 12, +++, t we choose for Lay; 
any element of 2 whose coset modulo & is Lr. 

Suppose now that h + 1 = iu4,. We begin by choosing an L;,, whose coset 

modulo % is Lays , and we consider the Lie algebra $ spanned by % and Liy: ; ' 
let H be the corresponding connected subgroup of G. Itisa closed simply con- 
nected sub-group of G. Moreover exp Lj4, lies in the coset exp Lis modulo A; 
hence exp Liga = 04,4, modulo A, which proves that Firs € H; since o;:,,, ¢A, 
H is a group of the kind which lina been considered in §III. It follows first 
that 5 contains an element Lj41 such that Lys = Lis (modulo %) and that 
exp Liu = 0%,,,- Since we have [Lé, , Lol =0 (1 Sa S&S ss), we have 
(Li,, Lin) e U(1 Sy St). Since [Li,, Li,)] =O for 1 S 7,7’ S t, the elements 
(exp wLi,) -- - (exp u;,Li,) constitute an sbeliasi sub-group of G. Since Y is an 
ideal and [Li, , Lis] € U, they are represented in the adjoint representation by 
linear operations which permute among themselves the elements of § and Y. 
The elements exp L;, (1 S y S #4) being represented by the unit operation, we 
obtain in this way " somepant abelian group @ of linear transformations of 
into itself. 

It follows from §III that § can be represented in the form §; X 3, where 
3 is in the center of § and Lia: € S:. Moreover the decomposition has, as we 
saw, the two further properties that: 1) if X « :, X = Lps:(mod ), we have 
exp X = exp Liz: = o%,,, 2) any automorphism of H which leaves o;,,, invariant 
gives a linear operation in § which permutes among themselves the elements 
of G1 : 

This applies in particular to the inner automorphisms of G produced by the 
elements (exp mLi,)(exp weLi,) --- (exp uel:,). Hence the operations of the 
group % permute among themselves the elements of $: . 

But $: M Y is a linear sub-space of §, whose dimension is exactly one less 
than the dimension of §,. Its elements are also permuted among themselves 
by the operations of G. Since G is compact, it is completely reducible, and we 
may conclude that there exists an element Lis; € $1 with the following properties: 
1) Lis: = Liys (mod %); 2) the elements of G permute among themselves the 
multiples (Ly; of Lnyi. Again because G is compact, it follows that its opera- 
tions leave Ly; invariant; if we remember the definition of G, we see that 
“ij InJ =O sys t). Finally, the conditions Lis: € $1, Lan = Lins 
(mod %) imply exp Lay: = exp Lixs = o%,,,- Hence the element Li, satisfies 
all our requirements. 

In this way, we shall be able to select m preaches Ii, Le, +++ , Lm satisfying 
the conditions A, B, C. 

On the other hand, the group A is isomorphic to the additive group of R"””, 
and Df) A is a discrete fe narene of A, i.e. a lattice. It follows that we can 
find elements Lmja, +++, Dn in & such that every element of A can be written 
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in one and only one way in the form (exp tmyilms) --- (exp tL.) and that 
there exists a subset {7.4:,---, 7} of the set {m + 1, --- , m} such that the 
elements exp Li,,,,--- , exp Li, constitute a base for D / A. 

It remains only to prove that the sets {Z;, Iz, --- , Ln} and {t,, i, --- , i,} 
which have been selected have the properties 1), 2) of our theorem. 

Let o be any element of G and o* be its coset modulo A. Then we can write 
o* in the form (exp &L7) --- (exp tmL2). But this product represents a coset 
modulo A which contains the element (exp t:J1)(exp &L2) --- (exp tmlim) since 
Lm belongs to the coset L;, modulo %. Hence we have 


(1) o = (exp 4L;)(exp Le) -- (exp tmLm)f teA 


and ¢ can be written in the form (exp tmiiDm4) --+ (exp trLn), which proves 
that o is equal to an element of the form (exp éJ)(exp #Le) --- (exp tL). 
Moreover t;, t2, --- , 4m are uniquely determined by o* (and a fortiori by o); 
hence ¢ is determined by o and also tmyi,--+ , tn. 

If oe D, we have ote DA/A and therefore o* can be written in the form 
(exp mi, Li, ) +++ (exp m;,L;,), with integral m;,,---, m;:,. Since the element 
(exp m;,Li,) --+ (exp mi,Li,) = o7'07;'2 --+ of + belongs to D, the element ¢ of 
formula (1) belongs to D and can be written in the form (exp mi,,,Li,,,) ++: 
(exp m;,L;,), which proves that the elements exp Li, , exp Li, ,--- , exp Li, 
constitute a base for D. 

We have already seen that [Z;, , Li,] = Ofor1 < a,8 Ss. Weshall see that, 
for a > s, L;, necessarily belongs to the center of %. In fact, let us consider an 
arbitrary operation of the adjoint representation of G; it changes L,, in an 
element L;, which still belongs to %; since ¢, = exp L;, belongs to the center 
we have exp L;, = exp L;,. Since A is an abelian simply connected group, 
we conclude that L;, = L;, , which proves that L;, belongs to the center of ¥. 
This completes the proof of our central theorem. 


Remark. Wemay assume that the set {7i; , 2, --- , 7,} is the set {1, 2, ---,7}- 
We again prove it by induction on n. Suppose that we have {i:, i, --- ,%} = 
{1, 2,---, s}. We have seen that L;,,,,---, Li, belong to the center of &. 


It follows that any element exp ¢,;,L;, (a > s) belongs to the center of G. Hence 
the property 1 stated in the theorem will still hold if we operate a permutation 
of L,, Le, --- , Ln which brings L;, at e-th place (a = s + 1, --- , 7), and this 
proves our assertion. 

TurorEeM 9. Let G* be a connected solvable Lie group. There exists a compact 
abelian sub-group T of G* and a sub-set E of G*, homeomorphic to a cartesian space 
in a certain number of dimensions, such that every element o € G* can be written in 
one and only one way in the form « = rn, re T,n€E. The mapping (7,1) 7™ 
is a homeomorphism of T X E with G*. 

In fact we have G* = G/D where G is a simply oonnéoted solvable Lie group 
and D a discrete sub-group of the center of G. We can find a base 
{I1, Le, --+, L»} of the Lie algebra of G such that every element in G can be 
written in one and only one way in the form (exp t,J;) ---+ (exp tnLn), and that 


th 
th 
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the elements exp I, , exp Lz, --- , exp L, constitute a base for D. Moreover 
the group A of the elements (exp 41) --- (exp ¢,,) is abelian. 

We have D Cc A and A/D is a compact abelian sub-group T' of G*. 

The elements (exp 4x) --- (exp t,Ln) and (exp tL;) --- (exp t,.L,) are con= 
gruent modulo D if and only if | = t, (mod 1),---,& = t, (mod 1), ta = 
tut, ++, tn = t. Let yg be the natural mapping of G onto G*. The set FE of 
the elements of the form (exp t,4:0,41) --- (exp t,L,) is mapped by ¢ in one-to-one 
continuous way onto a subset EF of G*. We claim that this mapping is a 
homeomorphism. In fact, let UO be the set of the elements (exp tl,) --- 
(exp tpl) with |i] < 4,---,|&| < 4; 0 is a neighborhood of £ in G and is 
mapped in a one-to-one continuous way by ¢; but we know that ¢ maps any 
open subset of G onto an open subset of G*; hence ¢ gives a homeomorphism of 
U with a subset of G*, which proves our assertion. 

It follows that E is homeomorphic to the (n — r)-dimensional cartesian space. 
Moreover every element of G* can obviously in one and only one way be written 
in the form rn, Te T, ne ZL. 

Since the mapping (7, ») — 17 clearly maps open sets of 7’ X E onto open 
sets of G*, it is a homeomorphism, and theorem 2 is completely proved. 

Since a compact abelian group is a torus-space (product of circumferences) 
we have the 

THEOREM 2°. Any connected solvable Lie group is homeomorphic to the product 
of a torus space and of a cartesian space. 
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ALTERNATIVE RINGS AND RELATED QUESTIONS I: EXISTENCE OF 
THE RADICAL 


By Max Zorn 
(Received July 29, 1940) : 


In a previous publication (1) I have presented a theory of alternative rings 
which led up'to the characterization of semisimple systems. The present paper 
is concerned with general hypercomplex’ alternative rings; its main object is to 
establish the existence of the radical. In other words it is shown that the 
properly nilpotent elements form a (two-sided) ideal. 

The first section contains the proof of several alternative identities,’ which 
are then applied (sec. 2) to the theory of the inverse element. Though a certain 
amount of duplication’ could not be avoided, the new arrangement has been 
given in detail because it is phrased almost entirely in terms of “associators” 
(a, b, c) = (ab)c — a(be). A new proof of the O-lemma‘* has likewise been in- 
cluded; the lemma is now obtained by inspection of an identity, and the assump- 
tion about characteristics is removed. This is of interest in connection with the 
theory of Peirce. decompositions, which was derived in (1) under the (now 
unnecessary) restriction that the characteristics be different from 2. 

Thus far the theory is entirely self-contained; this is likewise achieved in the 
third section by formulating two theorems from (1) as assumptions. These 
two propositions are indeed so simple that they may very well serve as axioms. 
On this basis it is possible to treat the rings which in the associative theory have 
been called ‘“vollstaendig primaer”;’ the existence of the radical is verified in 
this important special case. 

In the last two sections use is made of the theory of idempotents and Peirce 
decompositions as developed in (1); the necessary material is very briefly re- 
viewed in sec. 4. A certain ideal N is constructed in the fifth section, which 
depends formally on an arbitrary but fixed maximal Peirce decomposition. The 
concluding section contains the proof that this ideal consists exactly of all 
properly nilpotent elements and may therefore be regarded as the radical of 

the ring. 
_ Norations. With one exception (the delta in the Kronecker symbol 6;;) we 
use only italics; 2, j,'k, 1, m, n for indices and the like, a, b, c, d, ¢, 2, Yik 5 2% for 
elements, capitals S, Sx, Ni, N, A, B for sets. 





1 See sec. 4 and also (1) p. 137. 

*I am indebted to P. Jordan for many remarks concerning such and more general 
identities. 

* See (1) p. 142. 

4 See (1) pp. 129, 130. 

5 See (2) p. 256 and (3) p. 85. 
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The summation sign is used in such a way that the range of the index is: 
described in a parenthesis which precedes the letter >>. 

Theorems and definitions are numbered decimally within the section and the 
number of the section is prefixed; a theorem is indicated by a parenthesis— 
(3.7.1)—and a definition by a bracket—{1.1}—. A single numeral in a paren- 
thesis, like (2), refers to the bibliography. 










1. We shall constantly have to use the abbreviation: 
(1.1) (a, b, c) = (ab)e — a(be). 
These “‘associators” satisfy in every distributive ring the identity 
(1.2) (ab, ¢, d) — (a, be, d) + (a, b, cd) = a(b, c, d) + (a, b, c)d 


which becomes trivial when the associators are worked out according to [1.1]. 
Let us permute the arguments cyclically such that 6 is the first letter, and 
add the resulting equation to (1.2); we get 


(1.3) (ab, ¢, d) + (b, ¢, da) + [(be, d, a) — (a, be, d)] + [(a, b, ed) — (b, ed, a)! 
= a(b, c, d) + (a, b, c)d + bc, d, a) + (b, c, dja. 


From now on we ask that multiplication in the distributive ring satisfy the 
alternative law, which can be expressed in different ways: 

[1.4] A ring is called alternative and said to satisfy the alternative law of multi- 
plication if the associator (a, b, c) is an alternating function of its arguments. 
The associator must therefore be zero if two of the arguments are equal; it takes 
on the factor —1 if two of the arguments are transposed and it remains un- 
changed if the arguments undergo a cyclic permutation: 


(14.1) (a,a,b) = (a, b, a) = (6, a, a) = 0, 
(a, b,c) + (b, a, c) = (a, b, c) + (a, ¢, b) = (a, b,c) + (¢, b, a) = 0, 
(a, b,c) = (b, c, a) = (¢, a, b). 


The left hand member of the equation (1.3) would consequently reduce to 
(ab, c, d) + (b, c, da); if we put b = c, most of the terms will drop out and the 
equation (ab, b, d) = b(b, d, a) remains. Exchanging a and b and applying 
the alternative law we get the identity 


(1.6) (a, ba, c) = —a(b, a, ¢).° 
We add this equation to 
(ab, a, c) — (a, ba, c) + (a, b, ac) = a(b, a, c) + (a, b, ajc 






























(1) p. 142, 
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‘such as to get 
(1.7) i tad ' (ab, a, c) + (a, b, ac) = 0. 


On this sdentity we shall base, in the following section, the theory of the inverse 
- element. 
From (1.7) we pass to the sii identity 


(1.8) (ab, a, c) + (ba, a, c) 54 (6, a’, ¢) = 0 
by way of adding 
(ba, 4): Sorts ) +O, a, ac) = ba, a, ) + 0, seie re. 8. 


2. The theory of Peiree decompositions, which was given in (1), rests on the 
“O-lemma”’ : 
(2.1) .If ab =. ba = 0. then for every c the associator (a’, b, c) vanishes. 
Without any restriction this lemma follows by substitution from the identity 
(1.8); thus we are entitled to apply all results of (1) about idempotent elements 
, Without reservation. This will be of importance in the concluding sections. 

‘For the remainder of this section we deal with the theory of the inverse and 
assume consequently the existence of a unit element. 


(2.2). If ab:= ba’ = 1, then all esmntialtin of the forms. (a? b, > and (a, b’, c) 
are zero. 
In order to extend this to associators of the type (a, b, nt we replace b by b 
in the formula (1. 1): 


ag LRA ny ie a 4 Gale 


Since in me with the alternative law the equation ab = 1 implies that 
ab” = a(bb) =(ab)b = 1b = b, we get from (2.2) and (2.3) the theorem: 

(2.4) If the numbers a and b satisfy the equations ab = ba = 1, then every asso- 
ciator of the Sorm{a, b,.¢) vanishes.” 

The element b (if it exists) has all properties of an inverse element for a; it is 
unique and may be used for the solution of linear equations: 

(2.5) If ab = ba = 1, then the equations ax = c and ya = c have the solutions 
= be and y = cb; these solutions are unique. 

Indeed, the preceding theorem permits us to proceed as is customary for asso- 
ciative systems. The existence of the solutions follows from formulas like 
a(be) = (ab)c — (a, b, c) = le — 0 = c¢, and the uniqueness is exhibited by 
similar equations like ch = (ya)b = y(ab) + (y, a, b) = yl +0 = 

In particular, by considering the cases c = 1 and c = 0, we obtain the 
corollary: 

(2.6) Ifab = ba = 1, thenb is the unique solution of either one of the two equations 
' ax = 1 or ya: = 1; the element a is not a divisor of zero. 





7 Compare (1) p. 142, lemma 2. 
8 For additional information about the inverse in alternative fields see (4). 
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This justifies the definition: 
[2.7] If there exists an element b satisfying ab = ba = 1, then it is called the 
inverse of a and denoted by a”. 


‘This definition and the foregoing results permit us to state: 


(2.8) aa'=aa=1; (@')'=a; (a,a",c)=0. 


3. In order to postpone as far as possible any reference to outside material 
we introduce the following two assumptions. 


(3.1) Any two elements a, b of an alternative ring generate, by multiplications and. 


additions, an associative subring. 

That this assumption is satisfied in every alternative ring has been shown by 
Artin;’ one particular consequence is that the powers a” with positive exponents 
are commutative and associative with each other. : 

(3.2) An element a is either nilpotent, or else there exists a right multiple e = ab 
which is idempotent, such that e” = e ¥ 0."° 

[3.3] Dertnirion. An alternative ring will be called primitive if (a) it con- 

tains a unit element 1 which is different from 0; (b) the unit element is the 

only idempotent of the ring; (c) it satisfies the two above assumptions (3.1) and 

(3.2). 

In defining next the properly nilpotent elements we deviate from the classical 
definition” by including 0. This is done because otherwise we would always 
have to say “properly nilpotent or zero.” 

[3.4] An element a is called properly nilpotent if all multiples az are nilpotent. 
By virtue of (3.1) we see that (ax)” = 0 implies (ra)"** = x(ax)"a = 0, such 
that also the left multiples of a are nilpotent; we shall have to use this remark 
later. 

(3.5) If a multiple ax has an inverse then the element a is not nilpotent.” 

Let a” ~# 0 = a”; then 0 ¥ a” = a”™ “((az)(ax)') = (a” ‘(az))(ax)* = 
(a"r)(ar)* shows that a” = 0 is impossible. 

The theorems (3.6)—(3.9) describe some important properties of primitive rings. 
(3.6) If an element a of a primitive ring is not nilpotent then it has an inverse. 
Indeed there must be a multiple ab which is idempotent, by virtue of (3.2). 
This idempotent must be equal to 1 since our ring is primitive; the theorem will 
be proved if we show that ba = 1. . 

First of all we see that ba is not equal to zero; for in that case 1 = 1° = 
(ab)(ab) = a(ba)b = 0 would ensue.” Secondly ba is idempotent because of 
(ba)(ba) = b(ab)a = (bl)a = ba; consequently we have ba = 1 and b is the 
Inverse of a. 





*See (1) p. 127. 
. This assumption appears in (1) p. 138 as a theorem, valid in hypercomplex rings and 
incidentally based on the first and a part of the second chain condition. 

“See (3) p. 46. 

Compare (1) p. 142. 

* This part was omitted in (1) p. 142. 
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(3.7) Every nilpotent element of a primitive ring is properly nilpotent. 
(3.7.1) Corotuary. Every element of a primitive ring is either properly nil- 
potent or it has an inverse. 

In other words if a is nilpotent then every multiple is nilpotent. That is an 
immediate consequence of (3.5) and (3.6). For if—to conclude indirectly—one 
multiple ax were not nilpotent it would possess an inverse and (3.5) imply that a 
is not nilpotent. Using this and the remark about [3.4] we find: 

(3.7.2) If an element a of a primitive ring is properly nilpotent, every multiple 

ax and xa is properly nilpotent. 

Thus one of the two essential properties of two-sided ideals is verified for the 
set of all properly nilpotent elements. It remains to show that this set is a 
modulus (i.e. closed under addition and subtraction). This is accomplished if 
we prove the theorem: 

(3.8) If a and b are two properly nilpotent elements - a primitive ring, then the 

the difference a — b ts likewise properly nilpotent. 

An equivalent proposition would be: 

(3.8.1) If the element b of a primitive ring is properly nilpotent, and the difference 
a — bis not, then the element a is not (properly) nilpotent. 

For in that case the difference a — b has in inverse; let (a — b)x = ax — bx = 1, 

so that ax = 1 + bx. Since br is nilpotent, with say (bx)” = 0, the element 

1 + bz has the inverse (¢ = 0 --- m) Xi (—bz)’. Thus the element a is seen to 

have a multiple with an inverse; and that precludes, according to the theorem 

(3.5), the possibility that a be nilpotent. 

If we observe finally that the set of all properly nilpotent elements is not 
empty we may enunciate the theorem: 

(3.9) The properly nilpotent elements of a primitive ring form a two-sided ideal. 

We call this ideal the radical of the primitive ring and define more generally: 
[3.10] If the properly nilpotent elements of an alternative ring form an ideal, 

then this ideal is called the radical of the ring in question. 

We note without proof: 

(3.11) The residue class ring modulo the radical (if it exists) is semisimple; 
(3.12) The residue class ring modulo the radical of a primitive ring is an alter- 
native field; 
and 
(3.12.1) If a primitive ring does not contain properly nilpotent elements different 
from zero, or in other words if the zero ideal is the radical, then the ring 
as an alternative field. 

We intend to establish the existence of a radical for all hypercomplex rings; 
for that reason we state as an immediate consequence of (3.2) and [3.10] the fact: 
(3.13) If a ring does not contain any idempotent elements, it possesses a radical, 

which coincides with the ring itself. 


4. In order to complete our proof we shall have to use some of the definitions 
and theorems which were developed in (1). 
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[4.1.1] The set of all right multiples az of an element a (I-modulus) is denoted 
by aS; 
[4.1.2] The solutions of az = 0 form a “JI~modulus” denoted by (a\0); 
[4.1.3] The modulus generated by the products ab where a and b are respec- 
tively taken from the moduli A and B is denoted by AB. 
[4.2] An alternative ring is called hypercomplez if it satisfies the following three 
chain conditions: 
(4.2.1) every (necessarily descending) chain a'S is finite; 
(4.2.2) every ascending chain of JI-moduli (a'\0) is finite; 
(4.2.3) every descending chain of I7-moduli (a;\0) is finite. 
For such rings the following theorems were derived:™ 
(4.3) There exists (at least) one finite set e,--- e, of idempotents with ee, = 0 
whenever i and k are different, such that the ring S is the direct sum 


S= (i,k =0---n) >> Sa 


of moduli Sx , the modulus Si, consisting exactly of the elements x which 
satisfy 
Cin = Sj Lix , Lines = OnjLin 5 


the “component” of x in the modulus Sis denoted by x4 and fori #~#0#k 
is given by ee, ; this ‘“Peirce-decomposition” has the following properties 
(4.3.1) and (4.3.2). 

(4.3.1) The moduli So: and Si, fori = 0---n, consist of properly nilpotent 


elements; the modulus Soo ts a subring. 
(4.3.2) The moduli S::, fort = 1.-- n, are hypercomplex and primitive subrings; 
their respective unit elements are the idempotents e; . 

The last two properties express the fact that the decomposition is a maximal 
one. Without loss of generality’® we assume that there are actually idempotent 
elements, and that consequently the number n is not 0. Moreover it can be 
shown by a refinement of the construction in (1) that an additional condition 
can be satisfied, to wit 
(4.4) If eis an idempotent element, then the Peirce decomposition may be chosen 

in such a way that e is a sum of elements e; , or 


e=((=1.-.m) De. 


The next theorem deals with the products of the moduli S;,. Like the re- 
maining statements from this section it is valid for general Peirce decom- 
positions. 


(4.5.1) SixSir © Sir, 
(4.5.2) SiS C Sis, 


en OSL Te Nyon oy 
| 4 Under the seshshetiiees dial a + a = O imply a = 0; we have removed this restriction 
In the second section. 

* See (3.13) and (3.2) 
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‘ (4.5.3) All other products SizSim are zero. 
The second of these statements is amplified by the formula: 


(4.6.1)  caya t Yate = 0 tf ik; 


this fact is going to be used in the weaker form: 
(4.6.2) If Nix is a modulus contained in Six, , and if i # k, then we can say that 
Ni Six = SuNix . 
Concerning products of three factors only a part of the “Dreierregeln’”’ js 
necessary for our purposes; which we report in the next theorem: 
(4.7.1)" With the exception. of products like (xiyii)eiu every product of type 
(rinyer2u 18 associative, that is, equal to rx(yeizi). 
(4.7.2) A product of type (xinyix)zix, where the indices i and k are different 
retains its value if the factors undergo a cyclic permutation. 
We name finally the rules about powers of the form (a,r)” and (ajxx)”: 


(4.8.1) (a,x)”" = (assxss)” “(9 = 0.--n) Ws Aiki; 5 
and fori + k 
(4.8.2) (ainx)” = (anaes) “(7 =(0... n) BY Aikte; + (aintin) (Qintes)” 


If we compare, in the latter formula, the components in S;; of the two members, 

we see that 

(4.8.3) e:(anx)"e: = (antei)”, and this holds likewise if the two indices are equal. 
The formulas (4.8) will only be used in the last section. 


5. With the aid of theorems developed in 4 and reported about in 5 we shall 
now, for a fixed but arbitrary Peirce decomposition with the properties enu- 
merated in section 4, construct an ideal which later on will turn out to be the 





16 See (1) p. 134. 

17 See (1) p. 136, last paragraph of sec. 3. 

In order to obtain the theorem (4.7.1) from (1) it is necessary to distinguish various 
cases ; the following direct proof seems more satisfactory. We see that if two of the ordered 
index pairs 7j, jk, ki are equal, then the three indices are equal; for due to the cyclic sym- 
metry of the situation we may assume in such a case that the first two pairs are identical, 
and that would meani = j andj =k. In those cases which are not classed as exceptions by 
the wording of the theorem we have to deal with three different pairs of indices, and we may 
consider the elements in question as components of a single element, namely, the sum 
rij + yjx + zee = a. But the components of one element a with respect to a fixed Peirce 
decomposition are contained in the subring generated by a and the idempotents ¢ . 

A single element generates an A-ring (see (1) p. 127); and so does, by virtue of the 
O-lemma, and the A-lemma, any set of orthogonal elements. According to the generalized 
theorem of Artin two A-rings generate an associative ring, and thus the components of a 
single number are seen to be associative with each other. In particular the three numbers 
Liz» Yik , ANd Zp are associative, q.e.d. 

18 See in (1) on p. 135 the more general formula (23). 

19 See (1) p. 136. 





ers, 


wal, 


nal 
nu- 
the 


the 


fa 
ers 








ALTERNATIVE RINGS AND RELATED QUESTIONS 683 


radical of the ring S. This ideal N will consist exactly of the quantities whose 
components are “singular” according to the following definition. 

[5.1] An element ai (of Sx) is called singular if all elements of aiS,; are nil- 
potent, otherwise regular.” 

The following statements are easily deduced from the theorems which are 
indicated thereafter. 

(5.2.1) If one of the indices ¢ or k is zero then every element ay is singular. See 
(4.3.1). 

(5.2.2) For equal indices the elements a;; constitute the radical of the ring Si: . 
See (4.3.2) and (3.13). 

(5.2.3) If an is singular, all elements of Syiaix are likewise nilpotent. See the 
remark about [3.4]. 

We define: 

[5.3] The set of all singular elements in S, shall be denoted by Nix. 

We see that the sets Nip and No; are identical with Sj and Sp; respectively 

and are therefore moduli; likewise—according to (5.2.2)—the sets Ni; are the 
radicals of S;; and are therefore moduli. We maintain 
(5.4) The sets Ny are moduli in any case. 
Suppose that both elements ay, and bj are singular, and consider the difference 
ax — bx. Every multiple (ax — bx)rii is the difference of two numbers, 
namely, ai%%; and b,2,;, from the radical of S;; ; such differences are neces- 
sarily in the radical N;; . 

The definition of the sets Nix implies that NixnS.; C Nix, and from (5.2.3) 
we infer also that Si,.NVi; GC Ni;. Again we make a statement which is slightly 
more general: 

(5.5) The products NiSix, SiN ix , NiiSki and Sei ii are contained in the 
modulus N tk « 

For 7 equal to & nothing new is asserted; for the other cases it is necessary and 

sufficient to verify the following inclusion relations: 


(5.5 1) (N:Sa)Sux C Nes, 
(5.5.2) (SiN Sir C Nii , 
(5.5.3) (NeiSei) Set C Nix , 
(5.5.4) (SiN xs) Ss C Nis - 


Evidently the last of these formulas is equivalent to the third, on account of 
(4.6.2). According to (4.7.2) the third relation is equivalent to (SxiSii)Nes C 
Nix, and this would follow from the second formula because we know that 
Six C S,;. The second inclusion in turn would be a consequence of the first. 
For a product of this type is associative,” therefore instead of (5.5.2) the rela- 





*° Compare (2) p. 257. 
*1 See (4.7.1). 
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tion of Si(N je Sri) CN ai May be discussed ; and if indeed N jk Ski is contained 
in NV ;; , as is implied by the first formula in this group, then (5.5.2) must be true. 
The first formula, on which the others are thus seen to depend, follows from 
SixSzi © Sj; if we remember that according to theorem (4.7.1) the product 
(N i;S ik) Sei is associative. 
We define now the set N which we claim to be the radical of the ring. 
[5.6] The set of all elements whose components are singular with respect to the 
Peirce decomposition under discussion is denoted by N; in other terms 
N = (i,k =0---n) Ne. 
As announced in the beginning of this section, we have the theorem: 
(5.7) The set N is an ideal. 
First of all we see that by virtue of (5.4) the set N is a modulus; it remains 
to show that the products NS and SN are contained in N. In accordance with 
the rules (4.5) we have 


NS = (0) Nix)(X Sim) = (i, 9, = 0++- 0) Dd) NiSa + (6 ¥ k) YD Naa; 


the relations (5.5) have been derived in order to make evident that the last 

member is contained in N; the other relation SN C N is verified in the same 

fashion. : 

We note that the ideal N does not contain any one of the elements e; (these 
are regular) and is therefore different from the ring S. This would allow us to 
state another special instance of the final theorem, namely: 

(5.8) If a hypercomplex ring is simple and does not consist exclusively of nil- 
potent elements, then the only properly nilpotent element is zero. A fortiori, 
the 0-ideal is the radical of the ring. 

This partial result is of value in the theory of finite alternative division algebras.” 


6. By means of the following lemma we show that every properly nilpotent 
element is contained in the ideal N. 
(6.1) If one component, say x;; , of an element x is not singular, then we can find 

a left multiple ax = ayx which is not nilpotent. 

Suppose that there is an element ay, such that a;42,; is equal to e; ; the existence 
of such a quantity will be shown presently. We say that a2 cannot be nil- 
potent, since the formula (4.8.3) makes it evident that the component in Sj; of 
this product and any of its powers is equal to e;. In order to prove that the 
element aj, exists we note first that none of the indices is zero, since Zi Was 
regular. If the indices are equal, then 2;; has an inverse in the ring Si: which 
solves the equation. If finally 7 and k are different, we choose first an element 
yix Such that yiut,; is not nilpotent; the latter product has again an inverse 2ii 
in the ring S;;, so that 2::(yuze:) = e;. Since the last product is associative, 
by virtue of (4.7.1), we see that au = zy is the element we wanted. 





22 See (5) p. 402. 
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In the other direction we prove: 

(6.2) Every element of a modulus Ny, ts properly nilpotent. 

Indeed, according to the formulas (4.8.1) and (4.8.2) the power (ayr)” is a 
sum of at most two terms which contain the factor (a;<:)" ‘; the latter quantity 
vanishes for sufficiently high exponents, and we see that every multiple of ax 
is nilpotent. This implies in particular that the elements of the ideal N are at 
least sums of nilpotent elements, and we state less if we say: 

(6.2.1) The ideal N is contained in every ideal which contains all properly nil- 
potent elements. 

Combining this with theorem (6.1), which guarantees that every properly nil- 

potent element is necessarily a member of N, we obtain: 

(6.2.2) The ideal N is exactly the ideal which is generated by the properly nilpotent 
elements of the ring. 

This remark makes it clear that the ideal does not depend on the particular 
Peirce decomposition which has been used in the construction. We have men- 
tioned before that the idempotents e; are not elements of the ideal, since their 
components in S;; are not singular; the same holds naturally for any sum of the 
form (i = 1--- m) }_ e,;, provided that it is not zero. By virtue of (4.4) we 
may choose a Peirce decomposition for which a preassigned idempotent e turns 
out to be a sum of this type, and that implies: 

(6.3) The ideal N has no idempotent elements. 

If an ideal does not contain any idempotents then all its elements are nil- 
potent; for an element which is not nilpotent has an idempotent multiple; and 
an ideal contains the multiples of its elements. It is obvious, for the same 
reason, that all multiples of any element in the ideal are nilpotent. Thus we 
have finally arrived at the conclusion: 

(6.4) The ideal N, like any ideal of a hypercomplex ring without idempotents, 
consists entirely of properly nilpotent elemenis. 

If we combine the theorems (3.13), (6.1), (6.4) and the definitions [3.4] and [3.10] 

we may finally state the 

(6.5) THzorem. Every hypercomplex alternative ring has a radical. 


Conclusion. If one is willing to restrict the class of hypercomplex rings the 
theory becomes much simpler. Mr. Artin has proved the existence of the 
radical for finite algebras of characteristic zero by extending the theory of traces.” 
By the use of an entirely different method (developed™ first for Lie rings) I 
have been able to establish the nilpotency of the radical for alternative systems 
which satisfy the ascending chain condition for subsystems; this takes care of 
arbitrary finite algebras and likewise of rings which have a finite number of ele- 
ments. Whether this fact is also a consequence of our three chain conditions 
I do not know. 


* Not published; see (5) p. 402, 
“See (6). 
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On the other hand it seems very probable that the three chain conditions are 
not the weakest assumptions which furnish a radical. Several intermediate re- 
sults can be established under conditions which are compatible with infinite 
Peirce decompositions; and it seems that there is a less restrictive theory where 
infinite matrices with a finite number of nonvanishing rows play a role. 
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SEPARATION SPACES’ 


By A. D. Wa.iace 
(Received July 17, 1940) 


1. Introduction 


The purpose of this paper is to give a system of axioms suitable for the con- 
struction of a theory of connectivity in the large and for the study of connectiv- 
ity invariants. The postulate system, which is of an essentially new type, is 
based on the notion of separation as the undefined concept. In sections 2 and 
3 we correlate our system with the usual topological concepts, in section 4 
proofs are given for the “expected” theorems, and sections 5 and 6 are devoted 
to some general examples. Section 7 is devoted to transformations and a 
generalization of the Eilenberg-Whyburn theorem on the factorization of con- 
tinuous transformations. 






















2. Axioms and Definitions 


We envisage an abstract set of elements S together with a binary relation 
X|Y between subsets X and Y of S. By a weak separation-space (or an 
Sy-Space) we mean a system (8S, |) in which the following axioms hold: 

Axiom §.I. The null set is separated from every set: (X C 8S) > (| X).’ 
Axiom 8.II. Separation is symmetric: (X | Y) — (Y | X). 

Axiom S.III. Separation is disjunctive: (X | Y) — (X.Y = 0). 

Axtom S.IV. Separation is hereditary: (X | Y).(X: C X) — (Xi | Y). 

For convenience we adopt the conventions: 

(i) The symbol “X | Y” is to be read “‘X is separated from Y” or “X and Y are 
separ Bs 

(i) In a topological space two sets X and Y will be said to be “topologically 
separated” if they satisfy the condition of Lennes-Hausdorff, X.Y + X-Y = 0. 

Before examining particular instances of s,,-spaces it will be necessary to have 
the 
(2.1) Derinrrion. If X is a subset of an 8,-space then kX is the set of all points 
no one of which is separated from X: (p ekX) = (p| xX)’. By H(X) ts meant 
the class of all sets Y such that Y = kY > X, and hX is the set of points common 
to all the sets in the class H(X). 

The topology arising from the adoption of kX (respectively hX) as the closure 
of the set X will be called the k-topology (respectively h-topology) of the s,-space. 


’ Presented to the Am. Math. Soc., April 8, 1939 and April 26, 1940. 

‘We adopt the logical notation of Kuratowski [1]. Thus if A and B are propositions 
thnA = B,A+B,A+ B, A-B, A’ are respectively A is equivalent to B, A implies B, A 
or B, A and B, and A is false. The symbol (X | Y)’ will be constantly used. 
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(2.2) Exampir. Let S be any set of elements and let “X | Y” mean that XY 
and Y are disjoint. Here the system (S, |) is an s,-space in which X = 4X = 
hX so that the k-topology and the h-topology are each discrete. 

(2.3) Exampte. Let S be a topological space in which closure satisfies the 
axioms K.J-K.III of (3.3), i.e., S is a T;-space. Define “X | Y” to mean that 
X and Y are topologically separated, X.Y + X-Y = 0. It is clear that the 
system (S, |) is a weak separation-space with X = kX = hX. 

(2.4) Exampie. In this example S is Euclidean n-space and “X | Y” is to 
~ mean the existence of an (n — 1)-dimensional subspace L such that X and Y 
lie in different open half-spaces determined by L. Here we have X ChX = kX, 
and this latter set is the closed convex hull of X. In this connection see $4. 
(2.5) TuroreM. In an sy-space we have 

K.I, If X C Y then kX CRY. 

KI], X CkX, kO0 = 0. 

H.I, If X C Y then hX CAY. 

H.Il, X ChX, hO=0. 

H.IIl, AhX = hX. 

HK X CkX ChX = hkX = khX. 

Proor: (K.I,). If 2¢kX then (x! X)’ and hence by S.IV it follows that 
(x | Y)’ so that x ekY. 

(K.II.,). For the first part, if 2 «eX then («| X)’ by S.III so that cekX. If 
x ekOQ then (x | 0)’ and hence (0 | x)’ by S.II and this is a contradiction to S.I. 
(H.I,). Using the notation of (2.1) we have to show that if X C Y 
then H(Y) C H(X). If Re H(Y) then X C Y CR = KR. From this it 
follows that R « H(X). 

(H.II,,). For the first part it suffices to show that kX C hX by virtue of 
K.II,. But if zekX it follows that («| R)' for any ReH(X), so that 
xekR = R. Thus z is contained in every element of H(X) and thus z €hX. 
For the second part, since k0 = 0 we have 0 e H(0) and hence h0 = 0. 

Before continuing with the proof of (2.5) we need the 
(2.6) Dermuition. If X is a subset of an sy-space and X = kX then X is said 
to be k-closed. 

(2.7) Turorem. The common part of any aggregate of k-closed sets is k-closed. 

Proor: Let X be the product of the family of sets [Xa] where Xa = kXa. 
By K.I, we have kX C kX, = Xq so that kX CX. In consequence of K.II. 
we have X = kX. 

Returning to the proof of (2.5): (H.III,). We need only show that hhX ChX. 
By definition, hX is k-closed, i.e., we have khX = hX. But hX ChX = khX 
and this qualifies hX for admission to the class H(hX), and so hhX C AX. 
(HK). We have proved thus far that 


X CkX ChX = khX. 


From the first inclusion and H.1,, we get hX C hkX and we know that 
hkX C& hhX = hX in consequence of K.II,, and H.I,. From this we have 
hX = hkX and thus the proof of (2.5) is complete. 
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If the operator h (or k) is to be an analog of closure it should be additive. 
This requires that separation be additive. An s,-space satisfying 
Axtom 8.V. Separation is additive: (X, | Y)-(X2 | Y) — ((Xi + X:) | Y), 
will be called a separation space or an s-space. 

Examples (2.2) and (2.3) are s-spaces while Example (2.4) is not. For if S is 
the Euclidean plane and X,, Y and Xz are respectively the points with co- 
ordinates (—1, 0), (0, 0) and (1, 0) it is clear that S.V does not hold. 

(2.8) THeoreM. In an s-space K.I, and H.1, can be replaced by the stronger 
properties 

KI k(X + Y) = kX + kY 

HI A(X + Y) = AX + hY. 

Proor: It follows readily from K.I, and H.I,, that we have kX + kY C 
K(X + Y) and AX + AY ChA(X 4+ Y). If xek(X + Y) this means that 
(x |(X + Y))’ and hence by S8.II it follows that ((X + Y) |<)’ and so by S.V 
either (X | x)’ or (Y|z)’. From this, using §.I, it is manifest that z ekX + kY, 
proving K.I. To prove H.I we assume that z «h(X + Y) but is not a point of 
hX + hY. Then x is contained in every R in the class H(X + Y) but there 
exist sets R, and Re, respectively elements of H(X) and H(Y), which do not 
contain z. Now by K.I we have 


Ri + Re = kh, + kh, = k(Ri + x) DX + Y 


and z is not a point of Ri + R,. This a contradiction since (R; + R:z) « 
H(X + Y). 


3. Topological Characterization of S-Spaces 


We begin by stating 
Axtom S.VI. If x|X then x| kX. 

Axtom 8.VII. If x and y are distinct points then x | y. 

An s-space satisfying the above two axioms will be called an s,-space. Ex- 
amples (2.2) and (2.3) are s,-spaces. While axioms 8.VI and 8.VII hold in 
Example (2.4) it is not an s,-space since it is not an s-space. The effect of these 
new postulates is shown by the 
hoa THEorEM. In an s-space the conditions K.I-K.III of (3.3) hold and 

= hX. 

Proor: It is clear that S.VII and (2.5) imply K.II. In consequence of K.II, 
and K.I of (2.5) and (2.8) we need only show that kkX C kX. But this is 
immediate from $.VI. Further K.III implies kX «H(X) and so hX C kX. 
From HK of (2.5) we get equality here. 

The fact that K.I-K.III of (3.3) hold in an s,-space does not imply that 
X| Y is equivalent to X-kY + Y-kX = 0. 

(3.2) E:ampte. Let S be the unit interval with its usual topology and define 
“(X|Y) = (X¥.¥ = 0)”. The system (S, |) is an s;-space with X = hX =kX. 
If P and Q denote respectively the sets 0 < z < 1/2 and 1/2 < z S 1 then 
(P| Q)’ but they satisfy the condition of Lennes-Hausdorff since they are top- 
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ologically separated. It is interesting to remark that S has no s-cut-points and 
indeed any subset FR of S which is dense in S is s-connected (ef. §4). 

This example indicates the need for the following 
Axiom 8.VIII. If for each x«X and each ye Y we have (x|Y) and (y|X) 
then (X | Y). 

An s;,-space satisfying S.VIII will be called an s,-space. 

(3.3) CHARACTERIZATION THEOREM. In an s,-space we have 
KI kX + Y) = kX + kY. 

K.I] kX = X af X is null or a point. 

K.IIT kkX = kX. 

RI (x ekX) = (x| X)’. 

RIT (X|Y) = (X-kY + Y-kX = 0). 

Conversely, if the operator k satisfies K.I-K.III and separation is defined by 
R.II then S.I-S.VIII and R.I hold. 

Proor: It remains only to show, in consequence of (3.1) and the definitions, 
that R.II is valid. Suppose that the left member of R.ITI holds. If X-kY ¥ 0 
(say) then there would be a point in X not separated from Y and hence (X | Y)’. 
If the right member holds and the left does not, then by S.VIII we can find a 
point ze X such that (x| Y)’. That is, X-kKY # 0, a contradiction. It is 
readily seen that the converse part of the theorem is valid. 

It is to be noted that there exist spaces in which closure satisfies K.I, K.II, 
of (2.5), and K.III and in which separation (as defined by R.II) does not lead 
to R.I. This is simply because in such a space a point need not be a closed 
set, i.e., kx # x. Nevertheless we have 
(3.4) TuHrorem. In an s-space, for any point x and any set X 


(a | X) = (X-ka + a-kX = 0). 





Proor: The right member manifestly implies the left. Assume that the right 
member is false and first suppose that x-kX ~ 0. Then by definition (z | X)’ 
and hence the left member fails to hold. Suppose only that X-ka # 0. Then 
there is a p e X which is not separated from z. Consequently we have (z | p)’ 
and hence z ekp C kX in virtue of K.I,, of (2.5). Again the left member fails 
to hold. 

Topological spaces of the type (3.3) were studied extensively by Kuratowski 
[1] and others and are called 7;-spaces by Alexandroff-Hopf [1]. They may be 
described in terms of closed sets by saying that (i) the null set, the space and 
each point is closed (ii) the sum of two and the common part of any number of 
closed sets is again a closed set. 


abate tt. 
. ripe a 


4. S-Connected Sets 


In this section we assume that S is an s-space (Axioms S.I-S.V). Our gare 
is to demonstrate that the “expected” theorems hold concerning “connected 
sets. 


Wie si. baa 
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(4.1) DEFINITION. A subset of an s-space is said to be s-connected provided that 
it is not the sum of two non-null separated sets. 

(4.2) Turorem. If X is s-connected and contained in Y + Z, Y |Z, then 
X¥CYoX CZ. 

Proor: If X intersects both Y and Z then X = X-Y + X-Z and neither 
summand is null. By S8.II, 8.IV, and S.V it follows that X.Y | X.Z and this 
is a contradiction to the definition of s-connectivity. 

(4.3) THuorem. If [X a] 18 a class of s-connected sets no one of which is separated 
from the s-connected set X, then the set Xo = X + }! Xq is s-connected. 

Proor: If not, then Xo = Y + Z and Y | Z, neither being null. Then using 
(4.2) we may suppose that X C Y. Since Z-X. ~ 0 for some a we have (again 
by (4.2)) that some X, is contained in Z. By §.II, S.IV and 8.V this implies 
that X | X., a contradiction. 

(4.4) THroremM. If X ts a non-null s-connected subset of the set Y then there is 
a maximal s-connected subset Xo of Y which contains X. The set Xo is called the 
s-component of Y which contains X. 

Proor: It is readily verified that the set Xo = X + [all s-connected subsets of 
Y which are not separated from X] is an s-connected subset of Y containing X 
and is not a proper subset of any other s-connected subset of Y having these 
properties. Hence Xo is the desired set. 

(4.5) Turorem. If Y is an s-connected subset of the s-connected set X then 
(i) IfX-Y=P+Q,P|Q, then Y + Pand Y + Qare s-connected; 
(ii) If Z 1s an s-component of X — Y then X — Z is s-connected. 

Proor: (i). If we assume that Y + P= M + N where M | N and neither 
summand is null it follows by (4.2) that we may assume that Y is contained in 
N and hence by S.III that M is a subset of P. Hence, since, 


X¥=(X-Y)+YVY=P+Q+Y=M+(N+Q, 


on applying S.IV and 8.V it follows that M | (N + Q), a contradiction to the 
assumption that X is s-connected. 
(ii) Assume that X — Z =M + N where M!N. Since Y does not intersect 
Zand Y is s-connected we may assume that Y is a subset of N. By (i) the set 
M + Zis an s-connected subset of X — Y and since Z is an s-component of this 
set and M-Z = 0 we have M = 0. 

A somewhat different form of the first part of (4.5) is as follows: 
(46) Turorem. If P and P.Q are s-connected and R|(P — (Q + R)) then 
P.(Q + R) is s-connected. 
(4.7) Tazorem. If X + Y and X-Y are s-connected and (X — Y)|(Y¥Y — X) 
then X and Y are s-connected. 

Proor: We have 


X+¥)-X-Y=(X—¥)+(¥ —X) and X=X-Y+ (X—-¥) 


80 that we may apply (4.5). 
In (4.7) the condition (X — Y)|(Y — X) replaces the classical (and un- 
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necessarily restrictive) ““X and Y are closed in X + Y” which indeed ig not 
applicable in an s-space. To show the efficacy of the condition it may be 
shown, for example, that 7m order that a connected, locally connected metric space 
R be unicoherent it is necessary and sufficient that for any decomposition R = P + Q 
where P and Q are connected and (P — Q) | (Q — P) (topologically) we have P.Q 
connected. 
(4.8) Turorem. If X is s-connected and X C Xo © kX then Xp is s-connected, 
Proor. The set ¥; is of the form X + }) X. where each X, is a point of 
kX — X. We then apply (4.3). 
(4.9) THrorem. If X is s-connected so is hX. 
Proor: From (4.8) we know that 4X is s-connected. If hkX = M+N 
where M | N then, because X is a subset of hX, we may assume that X is con- 
tained in M. Since M | N it follows that M-kN =0 = N-kKM. But 


M+N =hX = khX =k(M+N) =kM+kN 


and thus M = kM > X so that hX C M and therefore hX = M. Hence 
N = 0 and we see that AX is s-connected. 

If the set J is s-connected and contains the points a and b and no proper 
subset of J enjoys both of these properties then J is said to be irreducibly s-con- 
nected betweenaandb. It may be shown that ifxel,a ~ x #~bthen] —2z= 
I, + ie where ael,, beI2, I,| Je and both J, and J, are connected. This 
enables one to order the elements of IJ (from a to b) by defining p < q if there 
is an x eI giving the decomposition above with peJ, and geI2. We state 


without proof the’ 

TureoreM. If I is an s-space satisfying Axiom §8.VIII and I is irreducibly 
s-connected between a and b, then, if the elements of I are ordered as above, the 
Dedekind proposition holds. 


5. An Example 


Let S be a space satisfying the first three axioms of Kuratowski (i.e., closure 
satisfies K.I-K.III of (3.3)) and let G = [Z] be a family of closed sets of S. 
Throughout this section we define separation as follows: The sets X and ¥ are 
separated (i.e., X | Y) provided that one of them is null or there exists an element 
Z of G such that 


(+) S-Z=U+V, XcCU, YCcv,  U.V+U-V=0. 


(5.1) THrorem. The system (S, |) is an 8,-space and for any subset P of S the 
sets kP and hP are closed and P < kP €& hP. 

Proor: It is manifest that the (5S, |) is an s,-space. _We-proceed to show that 
kP is closed. To this end it suffices to prove kP is the common part of a collec- 
tion of sets H = [U + Z] satisfying (+). We define this collection as follows: 
For any x not in kP we know that (+) holds with P = X and « = Y and we 





2 See Knaster and Kuratowski [1]. 
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let H*be the collection of all sets U + Z thus obtained. If we let Q be the 
product of all the sets in H then kP CQ. The reverse inclusion is easily seen 
to be valid. 

If the collection G contains only the null set then for each X in S the set kX 
is the sum of all the quasi-components of S which intersect X. If G contains 
all closed sets and if further S has the property that for any two topologically 
separated subsets of S the condition (+) holds, then the system (S, |) is an 
srspace and X,= kX = hX. Further, condition R.II of (3.3) is valid so that 
separation and topological separation are the same. Thus varying the collec- 
tion G enables us to get anything from an s,-space to the full topology of S. 

For the remainder of this section we suppose that the family G is a multiplica- 
tive ideal in the lattice of closed subsets of S, that is, if Z, and Z, are in G then so is 
Z,+ Z,, and if Zp is a closed subset of an element of G then Zp e G. 

(5.2) Lemma. If fort = 1, 2,--- n we have 


(+); S-—Z,=U;+ Vi, X;CU;, Y,;CYV,, 
U;-V; + U;-V; = 0, Z; €G, 
then there is a ZeG@ such that (+) holds with X = X,-X_--- -X, and 
Y=Y,+ Yot+---+ Yn. 

Proor: Since the general case follows by induction we restrict our attention to 
thecasen = 2. Let U = U,-Uzand V = Vi + V2, andsetZ = S— (U+ DV). 


That U.V + U-V = 0 is immediate and clearly X C Uand Y CV. Further 
we have 


Z = (S — U)-(S — V) = [((S — Ui) + (S — U2)]-[S — V] 
= [(Z, + Vi) + (Ze + Ve2)]-[S — Vi]-[S — Vo] 
= (Z, — Ve) + (Ze — Vi). 

























Since V; and V2 are open it follows (since G is an ideal) that Z ¢ G. 
(5.3) TuHzorem. The system (S, |) is an s-space. 

Proor: In virtue of (3.1) we need only prove that 8.V holds. But if X | Yi 
and X | Y; then (+); holds fori = 1,2 and X = X,; = Xz. Consequently (+) 
holds with Y= Y; + Y2 . 

The condition that G be an ideal is essential here, and even in this event it 
‘ porate to give examples of Peano spaces in which 8.VI-S.VIII all fail to 

ol 
(5.4) Lemma. If P is closed, S is compact metric, and for each p ¢P we have 
P| Q, then P |Q. 

Proor. For each p ¢ P condition (+) holds with X = Q and p = Y. Since 
B is compact and each V is open we can find an integer n such that (+), holds 
for¢ = 1,2 ---n, and P CV, + V2+---+ Vn. Thus (+) holds with 
X = Qand Y = P and this completes the proof. 
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(5.5) Turorem. If S is compact metric and P is s-connected then kP and hP 
are connected. 

Proor: If kP is not connected then we have kP = M + N where M and 
N are disjoint closed sets, neither of which is void. There exists a closed 
subset F such that 


S—-F=A+B, MCA, NCB, A.B+A.B=0, 


Since F-kP = 0 we have f | P for each f e F and thus by (5.4) we conclude that 
F|P. Thus (+) is valid with X = Pand Y= F. SinceP CA + Bwemay 
suppose that P-A ~ 0, and thus P.(U — B) = 0 in virtue of the fact that 
A-B =0. Further since B — B CF CV we haveU —-B=U-B. Itis 
clear that the sets U — Band V + B are topologically separated being disjoint 
and open, and moreover S — Z = (U — B) + (V + B). But since P is s-con- 
nected and P-(U — B) is not null we have P CU — B. Further N.(V + B) 
is not null and so for some ge N C kP we have q| P, a contradiction. The 
proof that hP is connected is as follows: By (4.9) AP is s-connected and by 
(2.5) khP = hP so that by what we have just shown the set hP is connected. 

The following example has many interesting features. 

(5.6) Exampie. Let S be the unit interval with its usual topology, let s be 
the point 0, and let G consist of all closed subsets of the sequence (1, 1/2, 
1/3, ---,1/n,--- 0). Notice that s = hs = ks but that if X is a closed subset 
which does not contain the point s then kX ~ s while LX = s. Here the only 
k-closed (h-closed) sets are the null set, the point s and S. 

While the systems studied in this section have long been known in topology, 
going back to Poincaré (cf. Menger [1, 2], Moore [1]), I am not aware that any 
effort has been made to introduce a topology of separation, or to superimpose a 
topology of any sort on the system. The following proposition has a certain 
intrinsic interest. 

(5.7) THrorem. The relationship of “not being separated” is a continuous 
function on the space of all closed sets of S in the sense that if X, ~ X and Y,— Y, 
then (X,| Y,)’ implies (X | Y)’, S being assumed compact and metric. 


6. A Dynamical Example 

In the present section we borrow the terminology of dynamics (cf. Hedlund 
[1]). Roughly speaking two sets will be “separated” if they are not “joined” by 
a motion. More exactly we consider a metric space S and a family T = [#] of 
subsets of S (not necessarily closed). Further we suppose that through each 
point of S there passes at least one motion (= element of 7). A subset P of 
S will be invariant provided that P = >t, t-P # 0. 
(6.1) Dertition. The subsets X and Y of S will be separated (i.e., X | Y) 
provided that 
(i) Nu element of T intersects both X and Y. 
(ii) No element of T intersects both X and Y. 

If the elements of 7’ are points of S then separation and topological separation 
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are equivalent and the k-topology and topology of S agree. It may be seen that 
the following is valid. 

(6.2) Tuuorem. The system (S, |) is an s-space in which Axiom S.VIII holds. 
For any subset X of S we have 


kX = > t,t-X¥ #0. 


In order that a set be k-closed it is necessary and sufficient that it be closed and 
invariant. The set hX is the smallest closed and invariant set containing X. 

In general a set ha (x e S) will not be a minimal set (in the sense that it may 
contain a proper k-closed subset). It is readily seen that if the motions are 
connected sets then if P is s-connected sets KP and hP are also s-connected 
and connected. 


7. Transformations and the Eilenberg-Whyburn Theorem 


If A and B are s-spaces then the single-valued transformation 7(A) = B 
will be called s-continuous provided that 


(X| ¥) = (T(X) | T“(Y)) 


for each pair of subsets X, Y of B. 

This definition is justified by the 
(7.1) Turorem. If A and B are compact metric spaces and “separation” is 
interpreted to mean “‘topological separation” then T(A) = B is continuous if and 
only if it is s-continuous. 

Proor: If 7 is continuous it is readily seen that it is s-continuous. If 7 is 
not continuous we can find a sequence z, — z in A such that 7(z,) converges 
toa point y ~ T(x). It follows that we may suppose that the sets 7(x) and 
T(a;) + T(a2) + +--+ are separated. Hence the sets x and 2; + 22 + --- will 
be separated in contradiction to the assumption that z is an accumulation point 
of the set 2; + %2 +--+. 

The following proposition (while not immediately relevant to our purpose) 
has considerable intrinsic interest. 

(7.2) Turorem. Jf T(A) = B is continuous where A and B are compact metric 
then the following properties are equivalent (‘‘separation” meaning “topological 
separation”’): 

(i) The image of an open set is open. 

(ii) If X C A and Y CB then (X | T(Y)) = (7(X) | Y). 

Proor: Note first that (under our convention above) a set V is open if and 
only if p|(B — V) for each pe V. Hence if 7’ satisfies (ii) it is to be shown 
that if U is open then y e T(U) implies y|(B — T(U)). Now if re T(y)-U 
then x and A — U are separated and hence the sets x and A — 7” '7(U) = 
T'(B — T(U)) are separated (in virtue of S.IV). It then follows by (ii) that 
y|(B— T(U)). We suppose now that 7’ satisfies (i). To prove the leftward 
implication of (ii) we see that 


(X | TY) > (TT (X) | T"(Y)' > (TH) | FY 
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by S.IV and (7.1). To prove the rightward implication we note that if (ii) 


holds then T”-*(P) = T”*(P) for each P C B, and T(Q) = T(Q) for each Q CA. 
Hence 


(X | T'(Y)) > (X-T"'(Y) + X-T(¥) = 0) 
— (T(X)-Y + T(X)-¥ = 0) > (7(X)| ¥), 


We suppose from now on that A is an s-space and T(A) = B is single-valued 
and varying hypotheses will be placed on 7 and B. 
(7.3) If B is any abstract set of elements and we define for each pair X, Y in B 


(X | Y) = (T(X) | T“(Y) 


then the system (B, |) is an s-space and T is s-continuous 

Proor: To validate the second statement it remains only to prove the first. 
Since S.I-S.III are immediate we have to show that S.IV and 8.V hold. But 
these are immediate in virtue of the identities (P C Q) = (T'(P) C T™(Q)) 
and T(P + Q) = T'(P) + T'(Q), and the axioms. 

(7.4) Turorem. If A’ and B are s-spaces, T;(A) = A’ and T;(A’) = B are 
singled-valued, and T = TT and T; are s-continuous, then T>2 is s-continuous. 

Proor: If X and Y are subsets of B and X | Y, then we have 7”"(X) | T"(Y). 
But 771(X) = T7'T7'(X) and TY) = Tz'Tz'(Y) so that since 7; is s-con- 
tinuous we have T7'(X) | 77'(Y). The reverse implication follows similarly 
so that 72 is s-continuous. 

(7.5) Turorem. If T(A) = B is s-continuous (B an s-space) and T™"(b) is 
s-connected for each b « B, then T~'(Bo) is s-connected for each s-connected subset 
Bo of B. 

Proor: Suppose that 7~*(B)) = P + Q where P| Q and neither summand 
is void. Then for any b e By we have T*(b) contained in either P or Q. Thus 
P = T°'T(P) and Q = T'T(Q) so that we get T(P)|7(Q). Since B = 
T(P) + T(Q) it follows that By is not s-connected, a contradiction. 

The following is a generalization of a theorem due to Eilenberg [1] and Why- 
burn [1]. It is to be noted that no use is made of upper semi-continuous de- 
compositions in the proof of the theorem. 

(7.6) TuEorem. If A and B are s-spaces and T(A) = B is s-continuous then 
there exists an s-space A’ and s-continuous transformation T;(A) = A’ and 
T,(A’) = B such that T = TT; where 

(i) for each a’ ¢ A’ the set T;"(a’) is connected and 

(ii) for each b ¢ B the set Tz'(b) contains no s-connected set containing more than 
one point. 

Proor: Let A’ = [a’] be the class consisting of all s-components of all sets 
T‘(b), be B. For each ae A let T;(a) = a’ be the s-component of 7 ’T(a) 
which contains a. For X’, Y’ C A’ define X’| Y’ to mean that Ty (X’) | 
T;'(Y’). Then by (7.3) T; is s-continuous and A’ is an s-space. By (7.4) 
T, = TT is s-continuous. If X’ is an s-connected subset of 7'2'(b), > € B, 


SEPARATION SPACES 697 


then by (7.5) we know that 77'(X’) is s-connected and is contained in 
TT;'(b) = Tb). Thus Ty’(X’) is in a single component of 7~'(b) and thus 
X’ is a single point of A’. 
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AND COMPLEX GROUPS 
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INTRODUCTION 


The original purpose of this paper was to apply tensor methods to a study of 
the invariant theory of homogeneous contact transformations. L. P. Eisen- 
hart and M.S. Knebelman (1) have shown how a vector A of 2¢ components in 
the phase space of 2g coordinates x* and p; may be decomposed into two geo- 
metric objects \ and » of g components each. The transformation laws of \ 
and » were such that whereas the new components X of one set were linear 
combinations of the corresponding old components A, the new components j 
were linearly dependent on both sets \ and yw. Because of this transformation 
behavior the decomposition just cited will be described as partial. In this paper 
we shall arrive at a complete decomposition in the sense that the }’s will be 
defined in terms of the )’s only and the j’s in terms of the »’s. The underlying 
formalism of the Eisenhart-Knebelman paper has been employed here and the 
decomposition at which we arrive agrees, when restricted to extended point 
transformations, with their similarly restricted decomposition. 

Our decomposition has been accomplished by introducing the notion of two 
families of complementary subspaces of the phase space and of projection tensors 
in these subspaces. Since the method is independent of any implication of a 
contact transformation on the coordinates it will first be developed in its full 
generality and the decomposition under the contact group will be presented as a 
special application of the general theory. As additional applications we shall 
give two decompositions of tensors under the complex group, one being real and 
the other complex. In section 16 the doubly homogeneous contact group of 
Schouten (2) is derived independently and a complete decomposition for this 
group in terms of an affine connection is given. 


1. THE UNDERLYING Systems oF DIFFERENTIAL EQUATIONS 
Two systems of partial differential equations of the solved type 


(1.1) Xaf = L*.def = Iaf + L’.(x*) a,f = 0 
(1.2) Xif = L*, Gof = aif + L’(2*) Of = 0 
5", ~(L*; , 
for which | L“,| = OA ge © wind a4f = Ff and wherein the indices 
L's a; ox4 








have the ranges (a) = (1, --- , g), (i) = (q+ 1, --- , N), and (A) = (1, --+ ,) 
will be called complementary. 
698 
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Associated with the systems X;f = 0 and X af = 0 are their dual Pfaffian sys- 
tems P* = N°qdx° = dx* — L*, dz’ = 0 and P' = Ng dz° = dz‘ — L', dx” =0 
with the matrix || Ns || = y r ' From these definitions, if f(x) 

—Le i 
is a solution of X; f = 0 its differential df = 0,f(dx* — L*, dz’) will vanish for 
any increment dx* satisfying P* = 0. Conversely, if f(x) is an integral of 
p* = 0 then df = (0,f + L’,0,f) dz* must vanish identically in dz’ and f(z) is a 
solution of X;f = 0. 

We shall confine our attention to (1.1) in making a few general re. .arks. 
Assuming for the moment complete integrability, there exist N — q prir cipal 
solutions (3) g(x“) satisfying the initial conditions ¢‘(z% , x’) = zx’ within a 
suitably small neighborhood of a point (2*) for which L‘,(x*) are regular and 
the most general system of solutions is given by any N — q independent func- 
tions f(y’) of the principal solutions. Since 0;¢" |,«.,2 = 4'; it follows that the 
determinant | @;f‘ | does not vanish identically. A general system of solutions 
f(z) determines an ©”~* family of g-dimensicnal subspaces S*(c‘) as defined by 
the equations f(z) = c’.. A parametric representation may be obtained by 
solving these equations to give 2° = y‘(x*, c’), the complete parametric system 


for S%(c') being 
(1.3) | gaa 2 = y(2",c’). 
Now the functions F* = x‘ — y‘(x*, c’) when equated to zero represent the same 


family of subspaces S*(c*) as defined by f‘(zx) = c’ and hence must satisfy the 
system (1.1) for all values of 2* lying in S%(c'), that is, 


(1.4) daw’ = L‘,(x", y*(2", c’)). 


Suppose S“(c°) to be parameterized in a general manner by the equations 
@ 


r= 2*(u*). Then the q invariants \. = hos are the scalar products of A, 


A 
with the q tangent vectors = of S%(c‘). Under parameter transformations the 
\’s constitute a g-component vector of S*%(c') which will be called the projection 
of A, on S%(c'). A distinguishing feature of the parameterization (1.3) may be 
emphasized by writing it in the form 


(1.5) cay gf = yu", c’) 


and transforming the z’s by #4 = f*(x) so that 2* = f*(2’, ¥‘(2’, c’)). If we 
insist on invariance of form for the first set of (1.5), namely Z* = @*, it may 
be seen that a transformation 2“ — #* induces the parameter transformation 
ud given by a* = f*(u’, y'(u’, c’)). The condition for non-singularity is 


- mg oo - a ad ~ 0. Reverting to our first notation * = u“ and 
applying (1.4) this condition on the transformation 2* — #* takes the form 
\Xsf*| = | agf* + L's0,f*| ¥ 0 for points (x*) of S*(c’). 
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It has just been shown how every coordinate transformation z* — 7% gatis. 
fying | Xaf* | ¥ 0 induces an isomorphic non-singular parameter transformation 
u*—%a%*. Thus the coordinate scalars ha = Ag a will transform as parameter 
vectors under a coordinate transformation 2 — Z and because of the covariantive 
parameterization z* = u* these parameter vectors will have the semblance of 
g-component coordinate vectors. The components A, are Ae(z*) = Ae + AL’, 
where the 2’s are confined to a subspace S*(c°). But since any point (z*) is a 
point of some such subspace, the quantities \. constitute a g-component factor 
field of the N-component vector field A,. Analogous conclusions may be drawn 
from the complementary system (1.2) leading to the complete decomposition 
ha = AgLh®. = Ae + ApL’g and i = Agh®; = A; + A,L’; of Ax into ag and 
N — q component factor field respectively. The restriction | X,f‘| = 
| a,f° + L’,0,f' | ~ 0 on the coordinate transformation #* = f*(x) analogous 
to | X,f* | ¥ 0 is encountered. To determine the transformation character of 
the factor fields we shall investigate the behavior of the basic systems of partial 
differential equations under a change of coordinates. 


2. THe TRANSFORMATION CHARACTER OF THE Basic Systems 


Let the most general coordinate transformation z — Z be sought which will 
preserve the solved form of the basic systems. That is, we stipulate the general 
laws 


Xsf=TYX,f Xf = 7/X,f 
: p* = Py, P* = P’T;' 
wherein | 73*| # 0 and |7;'| # 0 so that Xsf = O will imply Xf = 


def + L':d,f = 0 etc. If both sides of these equations be written as linear 
forms in d,f and dz* and the coefficients be identified there will result 


L’,Xet = Xet° ss, X @" = X48" 


(2.1) 


(2.2) a a é é 
T3°X 4° = 8% T/X,z' = 8; 
(2.3) L'pX,0' = —dgc' + Li den” = LX ax* = —Sa* + 150" 
['5,2% = —L*,5gc" L’5,2° = —L',d;2°. 


The first line of (2.3) reduces to the inverse of the corresponding line in (2.2) 
when one applies the second line of (2.3). We impose the restrictions 


(Re.s) |Xet*|~¥O |X| 40 


identical with those of the preceding section on the transformations £* = #*(c) 
so that L’s and L*; will be determined uniquely by (2.2). The second set of 
(2.3) may be regarded as identities which obviously hold for the identical trans- 
formation #* = 2*. Transformations x — # satisfying (Re) will be called 
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admissible and henceforth only such transformations will be considered. Equa- 







































ion tions (2.2) are adequate to establish the chain formulas of differentiation 

ter (2.5) Xph* = XX” Xk = XX 2 

ive giving the conjugate relationships 

‘ (2.6) 5s = X pu" Xp" a; = Xi2'X Pe 

. for the transformation #4 = z*. Comparison with the second line of (2.2) 

‘or gives Ta” = Xex* and T;' = Xja° and from (2.5) the transitivity of the trans- 

wn formations L — L follows readily. The transformation equations (2.1) of the 

‘on basic systems are thus a: 

nd en Ref = XfXe” Xf = XSX a" rh 

. (2. P* = PPX x* P= PR‘. if 
§ 

of 3. THE Group = A =* or ADMISSIBLE TRANSFORMATIONS ‘i 4 


Although the transformations + — @ have been defined as changes of coordi- 
nates, it will be convenient in this section to regard them as point transforma- 
tions in order to facilitate the description of the transitive properties of certain 

“ill objects transforming under them. We shall now establish the group properties 
of the transformations of the set 2: Z* = o*(x) characterized by | Xs%*| ¥ 0. 
We first observe that the identical transformation #* = z* isin 2. Secondly, 
if we first transform from the point (x) to the point (Z) by a transformation of 2, 
thereby determining at (#) a unique set of functions L's(#), and then transform 
toa third point (Z) by any transformation %* = ¢“(%) of a set which we shall 
call 2, where the elements of = are characterized by the property that 
= | X,t*| 0, it is always possible to transform directly from (x) to (#) by an 
ar element of 2, that is, by a transformation #* = 2Z“(x) for which | Xsz*| ¥ 0; 
for it is merely necessary to form the determinant of both sides of (2.5) and 
observe that | Xs%*| is the product of the two non-zero determinants | X,Z* | 
and | X,¢*| and so must be likewise non-zero itself. It remains to show that 
ifz— Zis an element of = the inverse transformation x <— Z is an element of 2, 
that is, we must prove that if | Xs¢*| ~ 0 then | Xer*| ¥ 0. But this follows 
immediately from (2.6), 1 = | X,x*|-| X#°|. Since it has now been proved 
that the most general transformation # —> % with which we shall be concerned 
is compounded from an inverse transformation of the set = followed by a direct 


ral 


c= = amine . - es 4 
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wate Lage EW. as F 
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2. 
a 
- 2 


2) transformation of 2 we have the 
TarorEM 3.1: Corresponding to any choice of the functions L‘,(x) in some arbi- 
trary but definite coordinate system x, the subset = of the general group G of analytic 
transformations * = #“(x) for which | Xst* | + 0 when taken in conjunction with 
2) all the inverse transformations of = constitutes a subgroup of G. ‘ | 
of An analogous theorem holds for the subset 2* of G for which | X;2°| # 0. 
1S- It follows that the transformations of the intersection set 2 ~ 2* likewise form 
ed asubgroup of G. 
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4. FACTORIZATION OF ANY TENSOR 


The invariance of the independence condition | L*, | # 0 will now be estab- 
lished by proving that after an admissible transformation 2 — Z characterized 
Xe" 0 

0 a" 
be likewise non-vanishing. The transformation laws (2.7) for the double 
system X,f = 0 become L*:0ef = L*oAzt°S,f and the conditions that these 
hold for arbitrary f are L°,d9%* = L*gA,t°. On forming the determinant of 
both sides it is apparent that | Z*, | ¥ 0 is a consequence of | L*, | # 0, | dsi*| 
~ 0 and | Asz* | ¥ 0 as was to be proved. 

We shall call the matrix || LZ“; || the factor frame of S”, a name suggested by 
the definitions \, = Agl°, of the factor components dq. and A; corresponding 
to A,. Thus the left members Xzf of our underlying equations are factor 
components of the gradient d,f. In a new coordinate system Z the factor 
components of an arbitrary vector A, are defined by Xs = AgL*,. For the 
case A, = d4f we know from (2.7) that A» = AeAst*. To verify that these 
hold for an arbitrary vector A, we rewrite them as (L“oAs&° — L°sdo%*)A, = 0 
and note that the coefficients of A, vanish as a consequence of the definitions of 
A;é* and of the transformation laws (2.2) of the L’s. 

Up to this point we have considered only a factorization for a covariant vector 
A, with factor components \4 = AgL*, transforming according to the laws 
Xe = ApX ax’ and X; = A,Xiz’. Because of the condition | L*, | * 0 there exists 
a matrix || M*, || conjugate to || ZL“, || satisfying M“oL°s = 6*2. As a basis 
for defining the factor components \* of a contravariant vector A* we shall 
insist that A°Ag = A°Ag hold for A* and A, arbitrary. This leads tu the defini- 
tions \* = M*,A°. From the four sets of relations 


“e+ M*,L's = 5°, = =M*; + M*,L’; = 0 
M’;+M',L’;= 8; M+ ML’, =0 


resulting from M“oL°, = 8“, these frame components are 


by | ds%* | ¥ Oand | Azz* | = 








~ 0 the new determinant | L*;, | will 


(4.1) 


(4.2) ae on M*,(A° a L.A’) xi oii Mi ,( A? co L’, A’) 
where the coefficients M“s and M‘; are defined by the conjugate relations 
(4.3) M",(8's — L’,L's) = 8% M'(8'; — L'el")) = 85. 


It will now be shown how a factorization of an arbitrary covariant vector 
will induce a corresponding factorization for an arbitrary contravariant vector. 
For if we define factor components A“ of A“.in two coordinate systems x and 
# by \* = M*,A° and \* = M“K° the two sides of A4A, = A® Ag will be equal 
respectively to the two sides of XX, = A%q, or Ae + NAL = A*(AAX pF") + 
d"(A:X,') by virtue of our definitions of X44 and \,. By insisting that these 
be identities in \, we obtain the desired laws }* = \’X,2* and \' = NX. 
It involves but a slight extension of this demonstration to define factor com- 
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ponents \“» of a simple mixed tensor A“, by A*, = M“* A°L", and observe 
that they will transform by A“, = \’.X pt" Xr", X'; = NX,2'X 2", X%; = 
\’,X,é°X ja” and Xs = W’,X,@'Xsx°. The generalization to tensors of arbitrary 
covariant and contravariant rank is obvious. 

Let us now return to equations (4.3). Defining ¢“; and t'; by t% = 6% — 
LL’, and t'; = 6; — L’,L’; the equations (4.3) are M“,t’s = 6°; and M‘t'; = 
s',. Their form suggests that ¢%; , t'; , M“s , and M’; are factor tensors and such 
is in fact the case. To establish the tensor character of t*, it is sufficient to ob- 
serve that (L%,L",)Xet” = (L’,L‘s)X ,£* follows from (2.2) and (2.3). By use 
of the ¢’s the transformation laws (2.2) of the L’s may be written in the form 


(4.4) L',Xpe? = L'eX 4° + tp0,@° | L*,.X ja" = L’;X 8° + 0 0,8°. 


It is now apparent that corresponding to any contravariant vector A‘ there are 
the two sets of factor vectors 


A** = AX —L* A’ -d* = AS — L‘, A’ 
\* = M*,n*? i = Mr”. 


These two sets are identical when and only when the tensors L*,L's and L‘,L’; 


vanish. 
Referring to the definitions in section 1 of the coefficients N*», of the funda- 


mental Pfaffians, it may be seen that || N“oL°, || = be 


0 
0 t; 
|N4o|-|L°s| = | t%s|-|¢';|. But since | N*5| = | L*s|, as may be seen by 
multiplying the last N — q columns (index j) and the last N — q rows (index 7) 
of |N“,| by —1, we have | L*, |? = | ¢%;,|-| ¢';| and in new coordinates 2, 
| I‘, |? = |i |-|#;|. Because the ¢’s are simple mixed factor tensors, | 7%; | = 
| t%s| and |Z; | = | ¢*;|, amd hence | Zs |? = | L*,|’. From continuity con- 
siderations under an infinitesimal transformation we finally have | L‘,| = 
|L*,|. Hence from L°s@gt* = L*oAst° there follows the relation | ds%* | = 
| Xet* |-| X 2° |. 


and therefore 


5. COVARIANT DIFFERENTIATION AND THE COMMUTATOR SYMBOLS 


We now assume the existence in S” of an affine connection L“z¢(zx) in terms 
of which we define the covariant derivatives of arbitrary vectors by Ac,s = 
Oehc — AgL® zc and A*,5 = dsA* + A°L“sq. Similarly for the factor com- 
ponents of these vectors we define factor covariant derivatives by Ac,2 = 
Xwde — Ael*sc and A*,, = Xpr* + A°%* eq, where the l’s are functions to be 
called factor components of connection, undetermined as yet, such that these 
latter quantities Xc,, and *,s will be factor components of the former tensors 
Acs and A“*\, respectively; Ac.s = Ar,el*ch*s and A°,,L“g = A*oL*s. 
These conditions determine 


(5.1) SxcliAg = L* onl sL*¢ + Lsdqh*c 
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as the relations that must exist between I*s¢ and Lc. In a new coordinate- 
system @ we have components of affine connection given by L°s,dgr* = 
Lord pt°d a" + 5% cx* and these induce the transformation law 


(5.2) I? scAgt* = I4 opA pt Aca” + X pAcr* 


according to which 
VayX pu" = U*ppXgx?X yx” + XpXyx* Vj, X 2% = 1°, aX yx" + XX 0° 
Vp Xia’ = UX ja’ Xie + XX Vp Xz" = UX Xx + FRc! 
while 1‘;, , 1°; , l';, and 1%», transform as factor tensors. 
The skew-symmetric portion 2“ ¢ of the affine connection L*5¢ = T5¢ + 
04 gc is a tensor with factor components gc which may be computed from (5.1), 
(5.4) ‘(Los » tg Lakh Ae = 4([° ac sens os) b*6 = 4s 


where I4 5¢ = L* del ¢ — L* cdgL* 580 that 1%, = I‘, = 0. The commutator 
(XX c)f is then (X 2X c)f = T° sc®of => M?* I gch* Of aS M ackaf where 
Mac = M*oI°sc. The quantities \*s¢ will be called commutator symbols for 
the operator X and in terms of them we have by (5.4), Q* orl Lh" ¢ = (Isc — 
I? on) — A*eclL*e, and hence the factor components w*sc of Qc are 


(5.5) w“se = (Is ac — I cn) — Wacl. 


(5.3) 


6. INTEGRABILITY CONDITIONS AND NORMAL COORDINATES 


The system Xgf = 0 will be completely integrable if and only if the com- 
mutators (XX y)f = N"s~X pf + d’syX,f are linear combinations of Xaf. Hence 
d's, = M'I's, = 0 are ‘the conditions for integrability. Because of the tensor 
character of M‘; and the circumstance that | M‘;| ~ 0 we may write I oy = 0 
in place of X's, = 0 and for this reason we shall call J‘s, an integrability tensor. 
When I's, = 0 there then exist N — q independent solutions f(z) which repre- 
sent «” * subspaces S‘(c’) of q dimensions when equated to arbitrary con- 
stants c’. 

Let us now consider the more general nunstnobeienials case wherein I';, # 0 
and therefore Xsf = 0 are not completely integrable. In this instance it be- 
comes more convenient to discuss the Pfaffian system dz’ — L', dx’ = 0 cor- 
responding to Xgf = 0 rather than these latter equations themselves. These 
equations determine an elemental g-spread of directions dz* at points of S” and 
in me holonomic case the totality of these elemental g-spreads envelops the 

* subspaces S*(c'). For the sake of generality this non-holonomic view- 

i will be retained henceforth and the holonomic theory will become the 

special case characterized by I's, = 0. Similar remarks apply to the Pfaffian 

system dx“ — L*,dzx’ = 0 corresponding to the complementary equations 
Xf = 0 whose integrability conditions are I*;, = 0. 

The question naturally arises as to what conditions may serve to assure the 
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existence of an admissible coordinate system y* for which one set of the L’s, 
say L*;(y), vanishes. This is answered by the following 
TureorEM 6.1: The factor tensor conditions 


(6.1) T°, = XL", — X,L*; = 0 
6.2) L*.L', = 


are necessary and sufficient for the existence of an admissible coordinate transforma- 
tion from general coordinates x* to normal coordinates y* for which L*;(y) = 0. 

Proor: Since J*;, and L*,L’s are factor tensors, if there is to exist a coordinate 
system y for which L*;(y) = 0 it is certain that the conditions of the theorem 
must necessarily hold. To show their sufficiency we have in virtue of (6.1) 
that the system Xjy = 0;y + L’;0,y = 0 admits q independent principal solu- 
tions y*(z) satisfying the initial conditions y°(zz , x0) = 2°. Let y'(x) be any 
N — q functions independent of y“ for which | Xjy"| # 0. We assert that 
y’ = y‘(z) constitutes a non-singular transformation to coordinates y for 
which L*;(y) = 0. From the transformation law L*,X,z’ = X,#* it follows 
on identifying #* with y* that L*;(y) = 0. Furthermore the transformation 
z— y is admissible, for | Xgy* | = | dgy* + L'sdry | has the initial value | 5°; — 
L'L'g |ehaat = | 5%s| by (6.2). Finally from | dsy* | = | Xsy*|-| Xiy‘| we see 
that | dny* | ¥ 0 so that the transformation z — y is non-singular. A similar 
proof holds for the complementary 

TueoreM 6.2: The conditions I's, = XgL', — X,L's = 0 and L',L’; = 0 
are necessary and sufficient for the existence of coordinates y* for which L's(y) = 0. 


7. An ArFInE CoNNECTION FoR S” 


By means of the equations (5.1) the factor components I“ s¢ of connection are 
uniquely determined by the components Ls: and conversely these latter com- 
ponents are uniquely determined by a choice of [“»-. In this section we shall 
adopt this converse viewpoint and shall define the factor components [*s¢ in 
the simplest way at our disposal. We first recall from section 5 that four sets 
of the connection components I‘ s¢ are factor tensors. We now restrict these 
to be zero 


(Rr.1) U's, = 0 1* ix = 0 i. = 0 1x = 0. 


From w“sc = 4{(I4ec — Ics). — dX“ 2c] a8 "9 by (5. 5) it follows as a result 
of (R;.:) that the quantities w*;, = 4(I%;y — A*j_) and w'g. = 4(I's, — A‘s) are 
factor tensors. We therefore make the additional restrictions 


(Rr.2) BP yy al i U'se nad V'sk 


which leave us with but two undefined sets of components 1s, and Ux. Thus 
an entire affine connection for S” is determined by (Rz.1), (Rz.2) and definitions 
of Is, and U'j, . 
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The restrictions just imposed on the factor components of affine connection 
simplify the formulas of covariant differentiation as follows: 
Ay.p = Xpry — Apl“sy Neg = Xe — Ad ix 
Ay.G = Xpry — ApA jx Neg = Xpre — Ard’ se 
= X,r* + d’d‘s, AX, = XA* + 4, 
= XA 4+ x1, A“ 9 = Xpd* + V1%s, 


(7.3) 


We define parallel displacement of a contravariant vector A* along a curve 
@ 

a“ = a“(s), where s is an affine parameter, by the equations A* 9 <. = 0. 

For a curve lying in a subspace S‘, holonomic or non-holonomic, the tangent. 


Q 
vector T* = 2 with factor components t* = M“, a satisfies the Pfaffian 
: p 
system t' = M'* AZ — L', a 0. From . = L*¢t° it then follows that 
dx“ dx® 


{* = ie The factor components of A*,¢ ok 0 may be written 


A 
La MM, + Inst” = 0 or + I“ pst"d* = 0 so that for parallel dis- 
placement it: a curve in S* we have 


See ee a 
at aR a tq =0 we L’, 5, = % 


For paraltel displacement of a covariant vector A, in S* the corresponding system 


d 
hie = 


Dr pede: dy* edt dz‘ ; dx’ 
(7.5) ds Aplca GF = 0 de wha ag = 0 ds Lis 
da* 
ds 
undergoes parallel displacement with respect to the curve. From (7.4) the 
equations of a path of S* are 

(7.6) dx dx’ dx’ _ dx L', dx? 


ip es ge ee ae = 


oF y's 
(7.4) ds + U4 


0 has the factor components 
= 0. 


A path of S* is defined as a curve whose tangent vector \* = —, \° = 0, 


= 0. 


8. THe Riemannian S” 
In this section we shall consider only the completely holonomic case for which 
“sy = d* ix = 0 and shall assume that S” is Riemannian with a metric tensor 
Gas. Ricciidaitiny the Christoffel symbols by I s¢ = $G4°(8s@ce + Ac se — 


ae} these latter quantities will be given 


9oGec) and their factor components by 7 
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in terms of Ic and L“, by relations of the type (5.1). We desire the explicit 
form of {4 BC f in terms of the factor components gaz = Gogl* «Leg of the metric 
tensor and for this purpose shall make use of the circumstance that G4s.c = 0 
and hence its frame components gas,c = X c9as — Jos {° C . -g 104° C a} must 
likewise vanish. Substituting into }(gas,c + gac,s — Qsc,a) = O and using 
the relations wz¢ = il({4 E ar s C s}) - Mr =0 expressing the vanishing 
of the factor components of the skew-symmetric portion of ['s,¢ we obtain 
after minor rearrangements 394¢ ({° B a} + 1"o8}) = [BC, A] — #(gno\%ca + 
gcod°na), Where [BC, A] = 3(Xagea + Xcgsa — XsGec). Combining these 


with 3940 ({° wh, ‘e ‘%ca}) = 4919\*sc we arrive at the desired expressions 


(8.1) gsc ‘? aah = [BC, A] — 4(gaad%ca + good? — Gacd* ac): 


We now ask if it is possible to restrict the metric tensor g 4, so that the quanti- 
ties . a8 will satisfy the conditions (R71) and (Ry.2) imposed in section 7 on 
the general affine connection I*5-. It may be verified that the conditions 
(Re.2) Nor =O AW =0 Gia =0 


in conjunction with (R71) and (R72) are sufficient to assure compatibility, for 
on substituting these values into (8.1) we are left with but two sets of non- 


vanishing relations ge, eu = [By, a] and gi, {+n} = [jk, t] defining the compo- 
nents *) and ‘.} which together with (R;.:) and (Rz.2) define the complete 


factor components \ of the Christoffel symbols zc. Under these condi- 


BC 
tions there exist holonomic coordinates z* for which L* »(z) = 6“ and the metric 


0 


tensor G4s(2) = g4a(z) is in the product form | ar Gi,(z") as follows from 
7 








Jost = 0 and g;;,, = 0 when written in normal form. 


9. Contact TRANSFORMATIONS 


In the study of contact transformations a change in notation is desirable. 
Here we are concerned with N = 2q variables z* commonly denoted by x 
and p;. We adapt the general factorization notation of the preceding sections 
to the contact group in 2g variables by choosing the two subranges to be 1, --- ,¢ 
andg+1,.-..,29. Vector components will be written in the form (covariant) 
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| Aa || = |[As, A*||, (contravariant) || A* || = || A’, As||. This convention 
is in need of an additional notation to distinguish between covariant and contra- 
variant quantities with indices in the same position but we shall deal only with 
a few tensors whose transformation character is clearly understood so that no 
ambiguity will arise. A similar scheme will be used for non-tensor quantities; 
for example we shall write L’,,; = LY and L™'; = L;;. 

The full contact transformation group in 2q variables x’ and p; may be defined 
as the totality of transformations X' = X‘(z, p), Pi: = Pi(z, p) for which 
P, dX" — p, dz’ regarded as an expression in the 2q variables x and p; and their 
differentials is the complete differential of some function of the z’s and p’s, that is, 


(9.1) _ P(x, p) dX" (x, p) — p, da’ = dW(z, p). 


More generally, a Pfaffian form go(u) du® in 2q variables u* will be said to be 
preserved modulo a complete differential under the transformation u — @ if the 
relation (yo(@)dst° — ys(u)) du* = dW(u) holds, the necessary and sufficient 
conditions on the coefficients of du” being gor(a)d4a°dsa" = yvap(u), where 
gas = 9492 — Os~4. From the general theory of Pfaffian forms the condition 
| eae | ¥ O is necessary and sufficient for the existence of a transformation u — 2 
sending the Pfaffian gq du® into a canonical form p, dz’. A contact transforma- 
tion x“ — 2“ may therefore be described as the most general transformation 
‘preserving the canonical form modulo a complete differential. For the canonical 
Pfaffian Ig dx° = p, dz’ the matrix of 4g = 041s — dell, is simply 

Ny 0 —6/ 
(9.2) || Taz || sere ; ij saad F 

ny; Gps 0 
and the tensor transformation law I4gdat® = I1gz04%° of its elements contains 
in concise form the familiar equations of definition of the contact group 


(9.3) —d,p; = Op; — 5x’ = o*z? 5'p; inal 0;z’ 


as was first recognized by Eisenhart and Knebelman (1). 

When the function W(z, p) of (9.1) vanishes identically it may be shown that 
P;(x, p) and X'(x, p) are homogeneous in the p’s of degrees one and zero respec- 
tively. For this reason transformations x“ — #4 for which p, dz’ = p, dz’ are 
called homogeneous contact transformations and they form a subgroup of the 
full contact group. By introducing two new variables, z*** and pay: = —1 and 
extending the transformation 2“ — 2“ to 


=t 


x z'(z, Pp) pi = Dpi(z’, Pr) 


a =a + We, pi) Best = Pos = —1 


1 q+ f 
(9.1) becomes p> p, dz’ = py p, dz’. Since any contact transformation in 2q 


variables may be written in this way as a homogeneous contact transformation 
in 2q + 2 variables we shall restrict our discussion henceforth to homogeneous 
transformations in 2q variables. 
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The coefficients || 1. || = || pi, 0 || of the Pfaffian p, dx’ form a vector whose 
transformation law I1gd,@° = Il, gives the relations 


(9.4) P,a:X" = p; P,a*x” = 0 


which are equivalent to (9.3) plus the homogeneity relations p,d"X‘ = 0 and 
p,s'P; = P; and either set fully characterizes the homogeneous contact group. 


for future reference. They are 
Ileal] = || pe, peL"* || 


— Ca; — Lx) — (6%, — LL”) 
— LL") (L* — L*) 


10. NorMAL COORDINATES FOR THE CoNnTACT FRAME 
TueorEM 10.1: The factor tensor conditions 
(Rvo.1) * = XL* —- XL =0 wi = L*-L*=0 wv =pL*=0 


are necessary and sufficient for the existence of a homogeneous contact transforma- 
tion from general coordinates x* to normal coordinates y* for which L"(y) = 0. 

Proor: The necessity of these conditions is obvious from their linearity in 
L'' and their tensor character. To prove their sufficiency we notice from the 
transformation equations L”X*p, = X*z’ of the frame components L* that if 
we identify #“ with the desired normal coordinates y* we must choose Y“(z, p) 
as solutions of the system X*Y = a*Y + L™a,Y = 0. The conditions J** = 0 
assure complete integrability of these equations so that we may find gq func- 
tionally independent solutions Y“(x, p) satisfying 


(10.2) ay’ + L"a,Y' = 0, 


and by the theory of section 1 for equations of this solved type the solutions 
Y' satisfy | ¥* | ~ 0. Contraction of equations (10.2) with p, shows that 
the solutions Y* are homogeneous in the p’s of degree zero. Because | 0; Y'| #0 
we may define g functions Q(z, p) as the unique solutions of Q,0;Y’ = p; which 
will be homogeneous in the p’s of degree one. By the contraction Q,(a°Y" + 
L"a,Y") = 0 we find Q,a*Y’ = 0 and hence y* = Y*(z, p) and q = Q,(z, p) 
satisfy the conditions (9.4) for a homogeneous contact transformation. 

We next ask for the most general contact transformation preserving normal 
coordinates. By identifying x* with y* and #* with g* in L* Xp, = L"X,2° + 
td’ as given by (4. 4), it follows immediately that any transformation yt ° 
normal coordinates y for which L(y) = 0 to a system @ for which also L(g) = 


must satisfy z = 0 and therefore must be an extended point transformation. 
i 


Hence the 

THEOREM 10.2: The most general homogeneous contact transformation y= 
i* preserving normal coordinates for which Li = 0 is an extended point 
transformation. 


(9.5) H as ze ™: ‘| = 
TAB nr; rt 
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We shall need the factor components +, = IgL°, and rss = Uorl*,L*s 
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11. RESTRICTIONS ON THE CONTACT FRAME 


We shall impose henceforth the restrictions (Rio.1) assuring the existence of 
normal coordinates. If we define a union to be any subspace of 8” within 
which an arbitrary displacement dz“ at a point (2“) satisfies p, dz" = 0 it may 
be shown that such a subspace may be regarded as a point locus in the point 
space of coordinates x‘ together with all its tangent hyperplane elements with 
‘coordinates p;. Thus the maximum dimensionality of a union is q — 1. In 
terms of unions the restrictions (Ri.1) require that any q independent solutions 
of the system X‘f = 0 when equated to arbitrary constants shall constitute a 
union of maximum dimensionality gq — 1, for in normal coordinates the system 


becomes = = 0 and its solutions y’ determine the »* unions y* = yj consisting 
of the point (y}) in the point space Y* regarded as the envelope of its 0%? 
hyperplane elements q; . 
By operating on the transformation laws (2.2) with the invariant operator 
—p," it may be shown that the quantities hy», = Li. — p,d"Lj, and h* = L* + 
p,0"L” are covariant factor tensors, and since h* vanishes in normal coordinates, 
L* + p,d°L* = 0. As a second set of restrictions on the contact frame we in- 
troduce 


(Ru.1) —ax = Lr — Liz = 0 hy = Liz — pO Lyx = 0. 


' We next collect for future reference some of the more useful consequences of 
_the restrictions of this section. From equations (2.3) we have L,0'x' = 
— L"5,p, and hence X;#' = X‘p;. The transformation laws of factor vectors 
simplify to \; = A,X ;2" and \’ = \"X,#’ for both the covariant vector || \. || = 
|| A:, A*|| and the contravariant vector || A‘ || = || A‘, Ax || so that an upper 
index of covariance has contravariant character and a lower index of contra- 
variance has covariant character. Under extended point transformations 7 = 
iy’), a = q-0iy", these laws reduce to the familiar form \,(g) = -(y)dw, 
(9) = X'(y)a.g’. The two tensors L' ‘ L’; and L°,L’s of section 4 coincide to 
give just one vanishing tensor LL,; = 0 and the two mixed tensors t%, and ¢'; 
reduce to the single Kroencker delta t‘; = 6’;. The transformation laws (4.4) 
of the L’s are now 


(11.2) Lid = 1,50 +09; L,27 = L%,z° + a’e 
and from (9.5) the factor tensors 4 and 2,4, have the same components as 
—gJ j 

Hyand Was, Il ra |] = | 2:0 [lll eae ll = yg fe The matrix [La | 

M* i nm Ee 
Mi; M, j -|4 he 67 
y* the commutator symbols \4 2c have the values }' ‘aly) = N*(y) = d%(y) = 
Aijx(y) sig Y Li iid Y.L;;, dij (y) = -¢d ALi; - 


M'; 


In normal coordinates 


conjugate to || L“, || is im 
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12, A CoMPLETE DETERMINATION OF AN AFFINE CONNECTION FOR CONTACT 
TRANSFORMATIONS 


We carry over into this section the restrictions (R7.1) and (Ry.2) of the general 
theory of section 7. In our present notation these restrictions are 


(Rw) bess si yi* = 1, = ui te 0, I, ma “, ‘ oe rij", 


and so a complete determination of factor components of affine connection will 
result when we have selected the two sets of components I‘, and 1;*. This 
choice will be made by insisting that the covariant derivative of the factor 
tensor tas be zero, From (7.3) the only conditions are Xjm' — 2,T'n — 
mi = 0 and X igi — aX; — m/l,” = 0, which reduce by the values of 
the 7’s given in section 11 to ; 


(Rw.2) lin +; =0 1* +r"; = 0. 


These restrictions define the components I’; and 1;* so that a complete affine 
connection is given by (Ru. 1) and (Rw.2). 

On expanding Az; and \*’; as defined in section 5 we find Me; = —d ‘La + 
L*Iyj, and XY; = —aL Lied” = —d:L" so that I'n = dla; — L" Muir 
and 1;* = aL”. By contraction with p; we find p,l' ;, = Lj which will be 
interpreted in section (14). The formulas of covariant differentiation coincide 
in pairs to give 

Mei = XA — Adj re? = XA 4+ Wn"), 

M7 = XM— AA MM, = XA + .78,,, 
where the vector \ is either covariant with components (A4) = (Ai, \‘) or 
contravariant with components (A*) = (A‘, Aj). 


13. HoLoNomic CoORDINATES FOR THE CONTACT FRAME 

If we operate on (5.1), XpL*5 = U°psL9 — L* ork nL" s with X ¢ = L° cdg and 
then form (XcXp)L*» = Nea okt we obtain ecole => L* qnsb" gL" chp ; 
er scp is the curvature tensor L* BcD = d-L* DB apL* cat L * aah ne 

L*yeL*cp and the quantities I4scn = Xcl“ns — Xol*cs + Ucal*os — 

Mn¢*cx — d° cpl“on are its factor components. Under the restrictions of sec- 
tions 11 and 12 the only non-vanishing factor components of the contact curva- 
ture tensor are 


Viet = —Uj'er = Xidgu? — Xedgt’ + rea'Ajt — et’ Dga” + Nevin’ — Avesd" 
Pal = Uj", = Xan"; + Xda’ + Ma” — dn’; — NAGe — ee NG 
and they reduce in normal coordinates y* to 
Uiee(y) = 8(0°L,;) — 01(8°L nj) + O'Lerd'Li; — 9°'Ly0"Ln; 
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(13.1) . “f- LoL; pe L"'Ly; 
L' x! (y) _ —F"La , 


We next seek the conditions for the existence of holonomic coordinates 
(2“) = (z', r;) for which simultaneously L*(z) = 0 and Lj(z) = 0. 
THrorEM 13.1: The conditions 


(13.2) l'* aid x*L* y XL vi 0 me wn 7 De L* a 0 « ay pL" mn 0 
(13.3) Tin = Xa -— XiLi=0 —my = Li,-—-Li=0 In’ =0 


are necessary and sufficient for the existence of a homogeneous contact transforma- 
tion from general coordinates x* to holonomic coordinates z* for which L* »(z) = 85. 

Proor: From theorem 10.1 the first set of conditions is necessary and suffi- 
cient for the existence of normal coordinates y* for which L(y) = 0. In this 
coordinate system the left members of the last of (13.3) reduce to (13.1) so that 
if there is to exist a coordinate system z for which L ;(z) = 0 it is at once obvious 
that the conditions (13.3) must necessarily hold. The sufficiency of the condi- 
tions of the theorem may be established by first applying theorem 10.1 which 
asserts the sufficiency of conditions (13.1) for the existence of normal coordinates 
y’. We then apply theorem (10.2) which states that the most general homoge- 
neous contact transformation y — z preserving L‘? = 0 is an extended point 
transformation, 


i dy 
T: p = a = r r| 
g=2y) n= a5, 


f we eae hy ead 
T ry =y) iter 





It remains to be shown that the conditions (13.3) are sufficient for the existence 
of q independent functions z'(y’) such that 7 will constitute a transformation 
to coordinates 2“ for which L,,(z) = 6 as well as L*(z) = 0. To accomplish 


this we note that the transformation law L;-(y) (% + Lau(2) ) " 








dz* ar. 
8q; 8q; dy say’ 8 (a2*\ ay’ 
a + Laz) ar, reduces to L;,(y) Fes % 26 Bye ayi Oe for an extended 
,' ' dy” dy’ a (az spe 
point transformation 7’, and since 3,|.7 0, Lix(y) 7% na at): . 
entiating with respect to q; and solving for ba (= we obtain 3(%) = 
; tals & dy* \ ay? y* \ oy? 
= 0’L, and thus arrive at the linear system 
oy’ 
(13.4) dj2° = Z;‘(y") 
(13.5) OZ; = 2,0 Liz 


in g(q + 1) unknowns z' and Z;' to be determined as functions of the q variables 


y. Since the quantities Z ; may be functions of only y' it is necessary that 
d’"Lix(y) = 0 and these are satisfied by the last of (13.3). The integrability 








TENSOR DECOMPOSITION 713 


conditions of (13.4) are found by expressing the cuniliieun that Z,'(0"L jx — 9°Lx;) 
shall vanish for all Z;, namely 0° (Lik — Lx;) = 0, and these are satisfied by 
the middle set of (13.3). Finally the integrability conditions of (13.5) are 
(0° Lix) — (0 Lia) + &Ln0'Lax, — 3°L,0'L, = 0 and these are satisfied by 
the first of (13.3) as may be seen by differentiating in normal coordinates with 
respect to q:. Finally from the general theory of completely integrable sys- 
tems of the type (13.4) and (13.5) there exist solutions Z;* for which | Z;'| = 
|a;2'| # 0. This completes the proof of the theorem. 


14. InTRopUCTION OF A Metric TENSOR 
The trajectories of a one parameter homogeneous contact group are solutions 
z“(s) of the system 


i 
(4.1) & 


ds 


= aC Ps a wae 


Q 


whose vector character is obvious when written in the form Il4q ad = 0,C. 


C(x, p) is necessarily homogeneous of degree one in the p’s and hence we may 
take C = 1 providing we exclude the singular locus C = 0 from our disen sion. 
Since C is an integral of (14.1) the condition C = 1 will hold at all points of a 
trajectory if it holds at one point. 

We now choose the scalar C to be a solution of Xf = 0, 


(Ru.2) 0,C + L,,d'C = 0. 


dp; _ , dz ‘ 
riders Lyi 7 These equations have 


been discussed in section 7 where it was seen that they express the vanishing 
dx* i 


of \; as defined by die: L*,d° so that these latter reduce to \* = all and 


ds 
dp; dz’ “ . . 
i= a Ler ee 0. Thus the displacement dz“ is restricted to the non- 
holonomic subspace X* of S**, where X° is the point space of points (z') and is 
characterized as a non-holonomic subspace of S** by the arbitrariness of the 


differentials dx‘. Furthermore it may be seen from (7. 5) that the —: of 
parallel displacement of the vector (44) = (p;, 0) are — ops : optus oa 0, but 


from the last of section 12 these become A; = 0 so My the bail under 
consideration have the property that the vector «4 undergoes parallel displace- 
ment along them. 

We shall now introduce a metric tensor G an(x), whose factor components gas 
will be of the restricted form Ilgae || = | “ where gig” = 6,. We 
define g” in terms of the scalar H = }3C”, where C is the scalar satisfying 
Xe = 0, by g(x, p) = (X‘H)" reducing in normal coordinates to g‘(y, q) = 
“Hy, q). From these definitions the components g'’(z, p) are homogeneous 


Then the second set of (14.1) becomes — 
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of degree zero in the p’s. There is a theorem (4) which states that a necessary 
and sufficient condition for the Hessian determinant | d°’H | to be non-singular 
is that | 0°’C | be of maximum rank g — 1. Restricting our discussion to those 
solutions C of X;f = 0 for which the rank of | 8°’C | is g — 1 we have the two 
fundamental requirements, g’’ = g” and |g" | # 0, of a metric tensor fulfilled. 
On doubly contracting the tensor g” with the covariant factor vector 7; = p; 
we obtain the invariant g”p,p, which in normal coordinates is °"*Hq,q, = 2H 
and hence in all coordinates H = 39"p,p.. The system of trajectories (14.1) 


which may equally well be written ad = 0'H and oP: = —0.H is now 
dx' 6 ir dp; ee rs 
(14.3) . ibe J Pr y te 39:9" D:De, 
or in alternative form 
| dp; _ dx" 
(14.4) Pi = Gir a Le ie" 


We next seek additional restrictions on the frame L*, and the metric g” 
which will identify the trajectories (14.3) with the geodesics of the subspace X‘. 
From equations (7.6) defining the paths of the point space X‘ we arrive at the 
equations of the geodesics by substituting the factor components of the Chris- 
toffel symbols in place of the affine connections components. Thus the geodesics 
are given by 


dx’ , fi \ dx’ dx’ — dp; de 
and it is this system that is to be identified with (14.3). 
15. ADDITIONAL RESTRICTIONS ON THE ContTACT FRAME 


We now investigate the possibility of restricting g‘’ to be free of the q’s in 
normal coordinates, d‘g‘/(y*) = 0, as expressed by g’* = 0. This will necessi- 
tate that H(y*) = 49"(y')g,qs be a solution of 0H + L,0"H = 0. The condi- 
tions are 


(15.1) 309° Gre = —LirgQs 

and by repeated differentiation we derive the necessary conditions 
dg” = —(Lag? + dLig") — * Lig" - 

If we introduce the curvature tensor restrictions 

(Ris.2) lx’ = 0 

reducing in normal coordinates to 0°"Lj(y) = 0 we are left with 


(15.3) dg” = —(8' Lig’ + Lig”) 
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which may be solved for the quantities d‘L » as Christoffel symbols formed from 
Giily 2 ei giily* ); ) ‘Li(y) =T ‘aly’ ); and hence 


(15.4) Lisly’, dm) = Ge" jely'). 


Conversely, if the expressions (15.3) resulting from (15.4) by differentiation are 
substituted into (15.1) it may be verified that these latter equations are satisfied. 

The restrictions just imposed on g” and L ;, are sufficient to bring the geodesics 
(14.5) into coincidence with the trajectories (14.3), for differentiation of the 
first of (14.3) in normal coordinates gives 


dy dy’ 4 dq; ay dy 
ds? r ‘aly’) a = 0 ds = Lily ) ds 


Under the present restrictions the equations (8.1) define { at by fhe 


30 "[(X gir + XGir — XGik) — Gied'er + Geer’ jr — Gied * ix) which reduce i In 
normal coordinates to the familiar Christoffel symbols, {*jz(y)} = T'yz(y’). 
Thus the normal form of the equations (14.5) of the geodesics is likewise given by 
(15.5) and because of the vector character of (14.3) and (14.5) this completes the 
proof of their equivalence. It is interesting to note that from the first of (14.4) 
and the relations p,X' j, = 0 resulting from the definitions \‘j, = —L"I,; of sec- 
it dx” dzx* 


tion 5, the first of (14.5) is given by “= + X.gn — XGrs)] — eats = 0. 


The tensor I’, — reduces in normal coordinates to d°L;, — I‘ j(y') = 0 


(15.5) 











and hence in all coordinates I, = oS The paths of X“ are therefore identical 


with the geodesics. We shall regard the affine connection of S”* to be that of 
section 12 where the components I , as there defined now satisfy the additional 
relations [°, = ie 
one set of non-vanishing components Lier = —lj'n reducing in normal coordi- 
nates to the Riemann curvature tensor R’ jx:(y” ) = aI - OMe) + Tel 
l',I",; formed from the Christoffel symbols I“ jx(y’). 
The results of this section may be summarized by the statement that the 


three ostensibly distinct systems of invariantive curves sia of 
Q 


1) the trajectories of I4q = = 04C where X,C = 

2) the geodesics of the point space X* with respect to the metric tensor 

g(a, p) = 3(X‘C*)4, 

3) the paths of xe relative to the connection I‘, = d‘La; — L"Iijr have 
been combined into just one system. 


Because of (Ris.2) the curvature tensor I* scp has but 


16. Dousiy HoMOGENEOUS Contact TRANSFORMATIONS 


Let be a projective space of N — 1 dimensions, S an arbitrary hse 
of 2 parameterized by the equations 2° = ¢‘(u%), i = 1, 2,---,N, a = 
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1, 2,---,N — 2, and P an arbitrary point of S. The conditions on.a set of 
hyperplane coordinates p; that they represent the tangent hyperplane at P 
may be found by insisting that the plane p; cut an arbitrary curve u* = u°({) 
in two coincident points at P whose parameter we choose to be t = 0. This 
means that p,2’(t) = at’ + --- when written as a power series in ¢. Hence the 
dx” du“ 
du* dt 
hold for an arbitrary curve u“(t) the conditions on the point-hyperplane element 
(x‘(u), pi(u)) that it represent a point z‘(u) incident with the plane p;(u) which 
in turn is tangent to the hypersurface S at 2‘ are p,2" = 0 and p, be = 0. But 
dually we may regard S as a locus of hyperplanes p;(w) and ask for the condi- 
tions on a point 2‘ that it be a “tangent point” of S. By analogous reasoning 


conditions for t = 0 are p,x” = 0 and p, — = 0. But since these must 


these conditions are p,z” = 0 and 2’ ee = 0. An N — 2 spread of point- 
hyperplane elements (x‘(u), p:(u)) will be said to constitute a union after the 
terminology of Lie if the conditions p,x” = 0, p, 7 0, 2” pa = 0 are satisfied 
as identities in the arbitrary parameters u*. 

Suppose now that the union of elements z“(u) viewed in = either as a point 
locus x* = x'(u) or as a hyperplane locus p;(u) is subjected to a transformation 
&* = #*(x"). If the new elements as functions of the same parameters wu“ are 
to form a union with the point locus #' = #'(x*(u)) and hyperplane locus 
Di = pi(x"(u)) asa —r of te: the transform of a union we must have 


ze" Dp 


due = p(z, D)Pr oo qe aa 


parameterization u“. Expanding the last two conditions we Ain pAk = pi, 
pdt = 0, #'0'p, = »zx', #0, = 0, and differentiation of the first with respect 
to x’ and p; gives (A — u)pi + dAp,xz" = O and (A — »)z* + d'Ap,z” = 0. But 
these latter conditions must hold identically both when p,2” = 0 and when 
px ~ 0sothat a4. = Oandd\ = » = »y = constant. There will be two distinct 
types of such transformations depending on whether \ > 0 or A < 0. In all 
generality we may take \ = 1 in the first case and \ = —1 in the second giving 
the two types 


Dr, = A(X, p) Pre’, pet = (2, Pa" see for an arbitrary 


5. O;k" = 9; # 0'p, = oa 
(I,) (Proper contact transformation) ' . Pt = px |. 
paz = 0 Z dip = 0 
(II) (Improper contact ae mi fep . SA tee r ¥3'p, = — 
transformation) p,0'#” = 0 ye ES Og Zz’ 0:p, = 0. 


Let us next consider an element transformation sending the point locus 
a=2 ‘(u) into the hyperplane locus p; = p,(x"(u)), and dually, the same locus 
when regarded as a hyperplane locus p; = p:(u) into the same barred locus 
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regarded as a point locus #* = #‘(x"(u)). The conditions that the transformed 
elements 74(u) constitute a union if the original elements z“(u) form a union 

=r r = OF 12 OP, r Op, _ , dx” 
are pri’ = 0'p 2", Br = BE oe a Pre? 
\' =’ = v’ = constant so that the transformations under discussion fall into 
two classifications: 


and again it follows that 


é pO; = 0 | # dD, = 0 
(I,) (Proper contact correlation) yi bt = px alo ih 
POL =z Z O:pr = Pi 
y ar rat He 
I (Improper contact p,d:% = 0 ia ini z'0'p, = 0 
(I ») correlation) poz ian pay r r 7 asd, an i 


By differentiating either the equations I, or II, one may verify the bracket 
relationships 


(for I, and ITy) {x', 2} = 09,0;8 — 09,02" = 0, 
{p:, pi} = 0°p,0°2" — 0%p,0'2" = 
{p:, 2°} = 0°02" — 0,p,0'%" = 3';. 


Similarly the transformations II, or I, yield 
(for II, and Ip) {z*, x’} ‘gt 0, {pi, pi} = 0, {pi ’ x’} a —8'; ° 


Using a device originated by L. P. Eisenhart and M. S. Knebelman (1) we 
employ the bracket relations to identify the expressions 


dx’ = 6; dz + Sx’ dp; “dpi = 5,p, dk’ + 5’p; dp; 
dx’ = d°p; dz’ — ae dp; — dp; = —dp, dz’ + 0,4’ dp; 


with the subsequent results 


(for I, and IT,) 6° = d'p;, F2' = —d's’, Ii = —OH;. 

In like manner we find 

(for II, and I,) 5° = —0'p;, Fx = a'%’, = pi, = 9D; 

These serve to establish the homogeneity properties 

foe 1 and 1h)? SET OP) agit ant yy 2 7 Mi ® 
pilex, kp) = kp,(a; p) pcx, kp) = cp (x, p) 


so that the transformations may be written 
(Land II,) z#° = x',é', pi = p,O'p; (Ip and Il) f= pd", pi = v'Opi. 


It may be verified that the four elements I, , II, , I, , Il, form a group with the 
identity I, which is isomorphic with Klein’s four-group. 
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17. A Compete DETERMINATION OF A Factor FRAME FROM AN AFFINE 
CoNNECTION IN A Fiat ELEMENT SPACE 


We consider in this section a flat element space in which there exist normal 
coordinates y* for which the affine connection components L*s¢(y) = 0. Since 
0 »¢ = 0 the connection is symmetric and will be written asT“,c. Confining 
ourselves to the doubly homogeneous contact group, we purpose to define a 
factor frame L*, in terms of I'y¢ and the element vector (x*). To this end 
we first observe that the invariant numerical tensor IIc, has a vanishing co- 
variant derivative in normal coordinates, and hence in all coordinates 
Host ac + Teel 24 = 0, or 

k 


Tig = Teg ri; = —I";; r’;* = r”;' 


(17.1) ri sal rt ri? on ae mt, = T;; ° 


. . . . A . . 
Since the covariant derivative of the vector 2“ satisfies 2*,, — 6“, = 0 in normal 
coordinates we have the zero tensor x°I'“ 9 whose expansion gives 


(17.2) pl" ji a aT; pe ve rT,*, pT” ;' pe aT,;, pr; ‘aii rT}; 


when reference is made to (17.1). We define the factor frame L;; and L" by 
Li; = Ti; = pT’; and L* =" = pI" so that T;; = Ty andr’ = I and 
shall demonstrate that the quantities I';; and I’ satisfy the conditions for a 
factor frame. First we note that the normal coordinates y* for which 
I“ zc(y) = 0 are likewise normal for L“, since L“s(y) = 645. Recalling the 
results of section 3, the transformation group now under discussion is the inter- 
section group 2 ~ =* where = and >* are characterized as all transformations 
from normal coordinates y* to general coordinates x* for which | Y,2’| = 


j . . . 
o ~ Oand | Y’p;| = Ks ~ 0. But since nll = ag > and >* coincide. 
dy 0g) ee 
Furthermore, from 5°; = Xjy'Y,«’ = Y"p;Y,x' = 0'p,0,2° = 0'q,0;y' and from 
the bracket relationship 0‘g,0 jy" — 0,9,0'y’ = 6°; it follows that the transforma- 


tions y — x must satisfy 


I 











(17.3) 3,q,0'y" = 0. 
If in the transformation law 
(17.4) TM 5c = Fp 0040 pk"0 ck® + Ford" 5 ck? 


we identify z* with normal coordinates y*, we have I“ s¢(z) = dot*d’ acy”, and 
on contracting with 2, zT,;‘ = 2'(d,p,0°;'y’ + 8°p,d°'q.) = y'd°;‘q.. But on 


differentiating y’d.p; = 0, y’d"(d.p;) + d*y'a.p; = —(y'd";*qe + O'y'diq) = 9, 
and therefore 
(17.5) zT,;}' = 2T,*; = 0. 


To obtain the transformation laws of the frame components sum (17.4) in the 
form T° spd 9rd ct" = T*gcdex° + 5 xqt*dck* with the covariant vector (t4) = 
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;, v), tol @ ck” = tel ecdst” + 2eAc(dsr°), and expand with paleo 
tof (17. 2) and (17.5). The resulting equations fall into the three sets L;,d'x" 
_L"5;p,, Lirdht” = Lada” + Oxp;, L”5,2" = La,’ + o'z’ agreeing with : 
the second line of (2.3) and with the form (11.2) of the first line of (2.2). This oe 
completes the identification of the quantities T;; and I’ as components of a 


factor frame. 


Pee ne 


Mw — 


18. A Reat Decomposition UNDER THE CoMPLEX GROUP 


5 ___ &. wy 


Let (c*) = (x*, a *) be 2q real variables i in terms of which g complex variables 
z are defined by 2° = a" + iz". If 2* be subjected to an analytic transformation 

= i(z’) the 2q real variables (¢*) = (2*, 2*) defined by # = # + :#* will 
slit the conditions 


(18.1) 0,@* = 038" =a" = —09, 2". 


These generalized Cauchy-Riemann equations fully characterize the complex 
group é“ = £“(x) induced on the 2g real variables x* by the group of all analytic 
transformations on the g complex variables z. The conditions (18.1) may be 
written more concisely as the transformation law E°,d9%* = E*90st%° of a 


E’; E'j|| _ | 0 —-3s; 
E’; E'; a; 0 
all coordinate systems. 

There is a marked similarity between the complex and contact transforma- 
tion groups which will become increasingly evident as we proceed. Just as 
L* = 0 served to characterize extended point transformations as a subgroup 
of the contact group, so here from the laws L*,(a;2" + L' 0:4") = 0,4" hi, L a;z" 
and L';(agz” + L’,0,2°) = 0,%' + L';0,2° it may be seen on identifying x* ak; a 
system of coordinates y* for which 


(Ris) ‘“(y) =0 L*y) =0 
and * with a new set of coordinates 7* with the same properties that these 
completely characterize the subgroup < A the complex proup consisting of all 
separated transformations of the form g* = ¢*(y’) and g* = ¢'(y*). We accord- 
ingly define the L’s by (Ris.2) in some arbitrary but definite coordinate system 
y* which will be called normal. As immediate consequences of these definitions 
we have L';L*, = 0. 
LL'j=0, Ly+Ly=0, Xa = X32", 
od 0. a8", 
bj a; Oo 
We complete the choice of affine connection made in section 7 by stipulating 
that the covariant derivatives of the tensor ¢“y shall vanish, the conditions being 


i : : . : : : H epee | 
fi =Xer—E' ates =O Cx jp = Xion — Nie t+ Cel jp = 0. 


simple mixed tensor EZ“, with the components 


























ll “2 || res || M+ E° 2 L*, || " 
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We define [*, and [' je as the solutions 
(Ris.s) Vn = vie lie = Vp 


of these equations. Because all components of the curvature tensor vanish for 
our present choice of connection the spaces S” is flat. From section 3 the sub- 
groups 2 and >* of the full complex group corresponding to our choice of frame 
consist of all transformations from normal coordinates y* to general coordinates 
x“ satisfying 


ax’ 
oy? 


da’ 


oy’ 


ax! 
oy? 





~0O and x~ 0 

















+. Ox! as ie eee 
+ Li) 55 ays + LID ay! 





so that > = >*. 


19. A CompLex DrcomposiITION UNDER THE COMPLEX GROUP 


The transformation equations of the factor frame are satisfied by the quanti- 
L'; L'; a‘; —i6'; 
LD Ly; is; 8 

systems, for they become —id,z° — 0;z' = 0,2@° — 10;2° and —ia;z° — 0,2° = 
8;z° — ia,e° which hold in virtue of the equations of definition of the complex 


in all coordinate 


ties L“, with the components 






































Bis 8g 
i yi, 59% 593 
group. The reciprocal matrix || M“ || is given by le ? mh, = ’ ’ 
M*; M’; ae 
0 3° 
The equations 
én bys F 
z= 2 + iz’ daca ahs 
7 ars 


i - ¢ - 
2=2' — iz 
ai = 5( — 2/) 


represent a transformation from the 2q real variables a“ to the q complex 
variables z’ and their conjugates z’. The vector components Z and factor com- 


ponents d of a real covariant vector A, and of a real contravariant vector A“ 
in the new complex coordinates z* will be given by 


1 : 1 ; } 
Zu =5(h— ik) = 3m,  Ze= 5 (As + is) = 505 


Z = v* + ia* = ay", Z* = A® — fA* = —2y". 


The following relations will be needed in our discussion and may be verified 
without difficulty: 
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a i _ 02° si spade 
X; = 2 X;z = 3a? Xx = 2 


: “SS : ~> , 
X,z = oz (3) ~ ( a2 ) _ #2 
ee BaF? az) ~ aa? az dz*) ~ az) ack’ 
re a ~ cole the complex conjugate. Under the transformation 
= #(2', 7 = Fe '), the z components Z, and Z* of A, and A“ transform by 


(19.1) 





az" az" ‘ az 
2, = Bee. AaB Fa TE, Bae 
and by merely replacing Z4 with \, and Z* with \* these same equations give 
the transformation laws of the factor components. 

We now come to the selection of an affine connection. Let us first consider 
a general affine connection L*sc(z”). Because of the separated form of the 
transformations z* — 2“ all components of affine connection will transform as 
complex tensors with the exception of the sets L‘y, and ig je which will obey 

oe! ; Of dz’, a aé Lie az! ; dz de’ ate’ 


ik 5g — Lv ogi ope T Sar ant? “oe = 553 agt | agi azt’ 





From (19.1) it follows that (L‘,) — L‘j¢ is a tensor and therefore we make the 
invariant restrictions 


(Ri.2) L' je = (Ln). 


In selecting factor components of affine connection I‘ sc we retain the restric- 
tions of section 7 leaving us with but two sets of components I’, and Lig trans- 
forming by 


(53) VaXei =U,R aR + RR Xa’ = Re Xe’ + Fike 


yet to be defined. From (19.1) it follows that I’, — 2L‘» and U'jp + 2iL'j 
are tensors and therefore we define the only non-vanishing factor components 
of connection by 


(Rips) Vn = QL, Vig = —UL*y. 


The formulas (7.3) for the covariant derivatives of factor vectors are now simply 
multiples of the covariant derivatives of the complex vectors Z;, Z* and their 
complex conjugates. The restriction (Riy.2) identifies the complex conjugate of 
the covariant derivative of a vector Z with the covariant derivative of the 
complex conjugate vector Z. 

The complex decomposition of this section is valid for the full complex group, 
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for the criteria for admissible transformations « — @ are | X,z°| = | J ~0 
zi 





; az" 
xX op S| 0. 
and | X;z° | | a # 
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REPRESENTATION OF ERGODIC FLOWS 


By WaRREN AMBROSE 
(Received December 28, 1940) 


Introduction 


The theory of measure preserving transformations and flows (a flow is a 
|-parameter group of measure preserving transformations) originated in the 
study of classical dynamical systems. The fact that von Neumann’s proof of 
the mean ergodic theorem and G. D. Birkhoff’s proof of the ergodic theorem 
make no use of the strong regularity conditions fulfilled by classical dynamical 
systems together with the fact that these theorems have applications in other fields 
where such strong regularity conditions are not fulfilled (notably in probability 
theory) has however led to a study of measure preserving transformations in terms 
of purely measure-theoretic formulations.’ The present paper is concerned with 
the theory of flows from this measure-theoretic standpoint. 

The principle result of this paper (Theorem 2) is a theorem which asserts that 
every (measurable) ergodic flow (an ergodic flow is one in which every set of 
positive measure sweeps out the whole space) is isomorphic (with respect to all 
measure properties) to a certain kind of flow that we call a flow “built under a 
function.”” This representation theorem has various consequences of which 
the most important probably is the following: In studying the flows of classical 
dynamics one is constantly picking out cross sections, drawing tubes about the 
trajectories and, generally speaking, considering separately what happens along 
and what happens perpendicular to the direction of flow. This isomorphism 
theorem makes possible a similar analysis,—for an ergodic flow,—’ under the most 
general measure theoretic formulations. It makes possible such an analysis 
because for a flow built under a function there exist certain sets with the proper- 
ties of a cross section and the measure on the space is the direct product measure 
of a certain measure on the cross section with Lebesgue measure along the 
trajectories; these are the essential features necessary for such an analysis. As 
in the classical cases various properties of the flow become intimately tied up 
with the properties of a related transformation on this cross section. We re- 





‘See for example III, IV and X in the bibliography at the end of this paper. 

* Flows of this kind were first introduced in a special case, by von Neumann in X, p. 636. 

* Recently Prof. 8. Kakutani has succeeded in extending this theorem to non-ergodic 
flows, using Theorem 1 of this paper. His method is similar to that which we use for the 
ergodic case. This and some other similar theorems will appear in a joint paper by Prof. 
Kakutani and the present author under the title, “Structure and Continuity of Measur- 
able Flows.” I take this opportunity to express my indebtedness to Prof. Kakutani for 
very many very helpful discussions of all the matters with which this paper is concerned. 
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mark that this representation theorem also enables us to show (except for one 
trivial and highly uninteresting kind of situation) that the trajectory of any 
point is always a measurable set (this we prove even for non-ergodic flows). 

In section 1 we give our definitions and in section 2 we prove our general 
representation theorem. In section 3 we discuss the group of unitary operators 
always (since Koopman, V) associated with a flow and establish a spectral condi- 
tion that a flow have a special kind of representation as a flow built under a 
function. We conclude by extending a theorem of von Neumann about differ- 
entiable ergodic flows to measurable ergodic flows; this extension is possible just 
because our isomorphism theorem enables us to obtain in the general case the 
tubes and cross sections for whose existence von Neumann assumed differenti- 
ability. 


1. Definitions 


DeFINITION 1. A measure space is a space, 2, on which a completed‘ (count- 
ably additive) measure, m(M), is defined, for which 0 < m(Q) < © and such 
that there exists at least one measurable set, M, for which 0 < m(M) < m(Q). 

DEFINITION 2. A measure preserving transformation, T, is a 1:1 point trans- 
formation of a measure space 2; onto’ a measure space Q% with the property that 
M, = TM, is measurable (in Q) if and only if M, is measurable (in Q) and for 
which m;(M,) = m2(M:-) for every measurable set M,.° 

DEFINITION 3. A flow is a 1-parameter family, T;, (-—2© < t < @) of 
measure preserving transformations of a measure space onto itself, which has 
the group property: 7:7, = Ti+. for all t and s. 

Dertnition 4. Let 7; be a flow onQ. The set M € Q is invariant under T; 
if whenever P is in M then T7,P is in M for all ¢. 

DeriniTion 5. The flow 7’; is ergodic if there are no measurable sets invariant 
under 7’; except possibly sets of measure 0 and complements of sets of measure 0. 

DEFINITION 6. Let 7; be a flow on ©’ and let S; be a flow on 2”. T, and 8; 
are isomorphic if it is possible to split 2’ into two disjoint measurable sets whose 
sum is 2’, 2; and 2, and to split 2” into two disjoint measurable sets whose 
sum is 0’, 9 and 9, in such a way that: 

1) Each of Q; and Q; is invariant under 7; and each of 9} and 9 is invariant 
under S,. 

2) Both and 9 have measure 0.° 





‘ The measure m(M) is completed if every subset of each set of measure 0 is measurable. 

5 We use the term ‘“‘onto” with the customary meaning, that every point of 922 is the 
image of some point in Q, . 

° Here we use m, for measure on Q; and m; for measure on 2. Obviously 7’ is a measure 
preserving transformation of 2; onto Q, if and only if T-! is a measure preserving trans- 
formation of 2: onto 2,. Usually Q; and Q, will be the same. 

‘If T and S are transformations then by “‘T = S’” we mean that TP = SP for every 
point P. 
® We allow the possibility that one or both of 9; and 0%’ is empty. 
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3) There exists a measure preserving transformation, R, of 2; onto 2;/ such 
that 7,P = R"S.RP for all P in Q, (and all ¢). 

“broughout this paper L will denote the real line taken with Lebesgue 
measure. If 2 is a measure space then 2 X L will denote the product space of Q 
with L, taken with the completed direct product measure defined in terms of 
Lebesgue measure on L and the given measure on @. 

Dermnition 7. Let 7; be a flow on Q. T' is measurable if the function 7'.P 
is a measurable (P, ¢)-function, i.e. if for every measurable set, M, in Q the 
(P, t)-set for which 7,P is in M is measurable in @ X L." 

Equivalent to this definition is the assertion that for every complex-valued 
measurable function f(P) defined on Q the function f(7,P) is a measurable 
(P, )-function. 

DeriniTIon 8. Let 2 be a measure space and 7’ a measure preserving trans- 
formation of 2 onto itself. Consider the product space 2 X L of Q and the real 
line L (where measure on this product space is defined multiplicatively in terms 
of Lebesgue measure on L and the given measure on 2). Let f(P) be a real 
valued integrable function defined on 2 with f(P) > c > 0 for all P in Q, and 
let & be the portion of 2 X L under the graph of f(P), i.e. let & be the set of 
points P = (P, x) for which O S x < f(P). Then@is a measure space. Define 
the flow 7, on & by 


(T(P, x) = (T°P,t +2 —f(P) — --- — f(T” 'P)) 
forn > Oandf(P) +--. + f(T" 'P) — x 
St<f(P)+---+f(T"P) -z, 
(1.1) {7(P, x) = (P,t + 2) for —z St < f(P) — 2, 
T(P, 2) = (T"P,t+2+f(T'P) +---+f(T"P)) 
forn > Oand —f(T"P) — ... — f(T-"P) — x 
<t< —-f(T"P) —..--—f(T""P) — «. 





\ 


We call 7, the flow built on the measure preserving transformation T under the 
function f(P).° 
In this definition we have demanded that f(P) be integrable only to make 2 © 

have finite measure; if we were considering flows on spaces of infinite measure 
then we would only have demanded that f(P) be measurable and > 0. We 
might have required that f(P) be positive everywhere rather than uniformly 
Positive; this distinction is unessential but with f(P) uniformly positive some 
of our proofs are simpler. 





: In case © is a topological space and the measure is “properly” related to the topology 
It 18 usual to define a measurable flow as one with the property that for every open set 0 
in @ the (P, t)-set for which 7'.P ¢0 is measurable in @ X L. This apparently less restric- 
tive definition is however equivalent to ours. For a proof see IV, pp. 9, 10. » 

a We shall not prove that the 7, thus defined actually are measure preserving trans- 
formations, A proof that they carry measurable sets into measurable sets is contained 
in the proof of Theorem 1. An application of Fubini’s theorem then shows that they are 
measure preserving. 
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In considering flows built under a function it will be essential to consider the 
functions F(P) and G(P) defined by 
(1.2) F(P) = F(P, x) = f(P),” 
(1.3) G(P) = G(P, x) = z. 
Intuitively, G(P) is the length of time since a particle now at P was last in Q, 
while F(P) is the length of time since it was last in Q plus the length of time 
before it will again be in Q. 


The sets defined by G(P) = constant give us cross sections for such a flow. 
If M is any set in Q we call the P-set defined by 


[ PeM] 


(P,2) 


a tube and we say that M* is based on M. If M* is a tube with F(P) => d> 0 
for every P in M* and if 0 S a < b S d we call the set M*(a, b)” defined by 


M*(a, b) = M*. las G(P) < 5] 
a box, and the set M*(b) defined by 
M*(b) = M*[G(P) = b] 


a leftover of the tube. We say that M* is the tube through the box M*(a, b).” 
Let M* be a tube and let M; and M,,; be defined, for each positive integer n, by 


= [ka < F(P) < (k+1)2"|-M*, k=0,1,2,-:- 
ad = [72° < GP) <(j+1)2"]-M, 7 =0,1,---,k-1. 
It is readily verified that 
(1.5) mS m= oe (n— @). 


This fact will frequently be useful in what follows because it shows that every 
tube is a limit of sums of boxes. 


2. The first representation theorem 


In our definition of a flow built under a function we specified that the measure 
on 2 be the direct product measure of a measure on Q with Lebesgue measure 
on the z-axis. Suppose, however, that we have a flow defined by (1.1) on 4 
region © without assuming that the measure on @ is such a direct product 
measure, and suppose that we do not even have a measure defined on © (so that, 





1 We shall use the notations P and (P, x) interchangeably for points in 2. 

12 We shall adhere to this notation, M* for a tube, and M*(a, b) for a box through Me, 
throughout this paper. 
13 Obviously a box uniquely defines the tube through itself. 





th: 


Sol 


nu 
M 
sh 


cor 
is { 
on 
if a 


me! 
of 1 


Iti 
ig i 





REPRESENTATION OF ERGODIC FLOWS 727 


in particular, we are not assuming that f(P) is a measurable function or that 7 
ig a measure preserving transformation). The following theorem shows that if 
the flow satisfies certain measurability conditions then it is possible to put a 
measure on @ for which T will be a measure preserving transformation and f(P) 
will be a measurable function and such that the given measure on © is the direct 
product of this measure on Q with Lebesgue measure on the z-axis. 

TuzorEM 1. Let 7; be a flow defined by (1.1) on a region Q, with a measure 
n(M) on®. If Tis a measurable flow and if the functions F(P) and G(P) are both 
m-measurable, then there exists a measure m(M) on Q for which f(P) is a measurable 
function and T is a measure preserving transformation and such that m(M) ts the 
completed direct product measure of m(M) on Q with Lebesgue measure on the x-axis. 

Proor: We first define a Borel field” IN of sets in Q and put a measure, m(M), 
onit. We define 2% to be the collection of those sets, M, in Q with the property 
that M*, the tube based on M, is m-measurable. Obviously, Jt is a Borel 
field. It is trivial from the fact that F(P) is m-measurable that f(P) is measur- 
able with respect to It. We define m(M), for M eM by 


m(M) = (1/c)m(M*(0, c)).” 


We want now to show that m(M) is a completed measure and that, with this 
measure on 2, 7’ is a measure preserving transformation. 

Because F(P) and G(P) are m-measurable we know that if a tube is m-measur- 
able then so are its boxes and leftovers. Now we show that if a box is 
m-measurable so is the tube through it. If a box is #-measurable then any 
sub-box which has the same tube through it is m-measurable because any such 
sub-box is obviously the intersection of the original box with a transform (by 
some 7';) of the original box. Also, if a box M*(a, b) is #i-measurable then the 
box M*(0, b) is m-measurable since the latter is obviously the sum of a finite 
number of transforms of the former. To show that the tube through the box 
M*(a, b) is m-measurable we see from (1.4) and (1.5) that it is sufficient to 
show that (for large n) each M;,,; is m-measurable; the m-measurability of the 
M,,; (for large n) follows from the fact that 


Mi; = Tj-»(M*(0, 2) [ka < F(P) < (k + 12"). 





The region & is not a product space unless f(P) = constant so it cannot be precisely 
correct to say that m(M) is a direct product measure. Precisely expressed what we mean 
is that if we define m’(M’) on @ X L as the completed direct product measure of m(M) 
on and Lebesgue measure on L then the following is true: if M C@ then M is M-measurable 
if and only if.it is m’-measurable and if it is M-measurable then m(M) = m’(M). 

By a Borel field we mean a collection of sets closed under the operations of comple- 
mentation and countable addition. 

* The notation M*(a, b) was introduced above. The number c here and in the remainder 
of this proof is some fixed positive number with the property that f(P) > c for all P in Q. 
It 1s easily seen (and in fact is a direct consequence of a later part of this proof) that m(M) 
is independent of the particular choice of ¢. 
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It follows readily from what we have just shown that m(M) is a completed 
measure. We shall now show that 7’ preserves m-measure. Let M «It and 
N = TM; we shall show that the #-measurability of M*(0, c) implies the 
m-measurability of N*(0, c) and that these two sets have the same 7-measure. 
This will prove that if M «Mt then TM eM and m(M) = m(TM). A similar 
proof shows that 7” also has these properties so this will complete the proof 
that 7’ is m-measure preserving. We define, for each positive integer n, the sets 


mM = M*. [re < F(P) < (k+1)2”"] [ro —c SGP) < k2”]. 
It is readily verified that 


T. >, M* > N*(0, c) 
imo 
3 (n— «), 
D T.-1M* + M*(0, c)” 
=o 


Because M* is m-measurable it follows from this that N*(0, c) and hence N* is 
m-measurable. Because in each of these sums the summands are disjunct it 
follows that m(M*(0, c)) = m(N*(0, c)) and hence that 7’ is m-measure pre- 
serving. 

Now we consider the space 2 X L and in this space the Borel field $y deter- 
mined by all sets of the form M X (a, b) where M eM.” Defining measure 
multiplicatively in Q x L’* we have a measure, m’(Bx), defined for all By ¢ Bs. 
Completing $s with respect to this measure we obtain a Borel field Mx. Now 
we note that & ¢ B ; this follows from (1.5) applied to the tube M* = 0 We 
define %’ to be the Borel field of sets of the form Bs@ where By ¢ Bx , and M’ 
to be the Borel field of sets of the form M,® where Mx e Mx.” Let M denote 
the Borel field of 7i-measurable sets. Then what we want to show is that 
Mm’ = M and that for sets M eM we have m’(M) = m(M). 

We now define §’ to be the field” consisting of those sets which are finite 
sums of boxes and leftovers of 7-measurable tubes. Now the Borel field deter- 
mined by §’ is B’.” To see this we note first that 8’ is the Borel field deter- 





7 The sets M; and M,,; are defined in terms of a given tube which we here take to be 
the tube through M*(a, b). 

18 By the interval (a, b) we shall mean the interval taken with its left end point a but 
without its right end point b. The Borel field determined by a collection of sets is the 
smallest Borel field containing that collection. 

* For a discussion of product measures in abstract spaces see VI. 

20 An equivalent definition of It’ would have been to say that it is the Borel field ob- 
tained by completing 8’ with respect to m’-measure. 

21 By a field we mean a collection of sets closed under the operations of complementation 
and finite addition. 

*2 For a discussion of fields, Borel fields, etc. see VI. 
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mined by all sets of the form (M X (a; b))- where M eM.” Now each such 
set is easily seen to be the sum of a box of some m-measurable tube and a left- 
over of some m-measurable tube; on the other hand, every m-measurable box, 
every m-measurable tube and every leftover of an m-measurable tube is in B’, 
so ¥’ Cc B’. This proves that B’ is the Borel field determined by §’. 

Both F(P) and G(P) are measurable with respect to B’; this is true for F(P) 
because the P-set for which F(P) > 6 is, for any b, an m-measurable tube and 
such a tube we already know to be in 8’. This is true for G(P) because 


|e®) < bj = Fe) < b] + [F®) > b] Je®) < bj. 


The first set on the right side of this equality is an #-measurable tube and the 
second is an m-measurable box, so G(P) is measurable with respect to B’. 

We shall now prove that Dt’ c WM and that if M «M’ then m’(M) = m(M). 
We know that every m-measurable box belongs to-IN, and that every leftover 
of an m-measurable tube belongs to I; it follows that §’ and then B’ is included 
in JX. We shall show that if M ¢ 8’ then m’(M) = m(M). This trivially im- 
plies that I’ < Mt and that m’-measure and m-measure agree on PM’. Con- 
sider an m-measurable box, M*(a, b); let n be a fixed positive integer and write 


M*(a, b) = M*(a, a + [b — a]/n) + M*(a + [b — a]/n, a + 2[b — a]/n) 
2p 


+ M*(b — [b — aJ/n, b). 
These sets are all disjunct and are transforms of one another (under members 
of the group 7',); so each has #-measure equal to (1/n)m(M*(a, b)). It follows 
readily that for y real and a < y < b we have 


(2.1) m(M*(a, y)) = [(y — a)/(6 — a)}m(M*(a, b)). 
From the definition of m’-measure we have 
(2.2) m'(M*(0, c)) = c(1/c)m(M*(0, c)) = m(M*(0, c)). 


Taken together (2.1) and (2.2) imply that m'(M )= m(M ) whenever M is an 
m-measurable box. By (1.5) it follows that this is true whenever M is an 
m-measurable tube and hence for all M in §’. Hence m’(M) = m(M) for all 
Me%'™ Hence 2’ C I and for sets in M2’ the measures m’ and mi agree. 





* This is a consequence of the following trivial lemma: If 2, D @:2, {Fa} is a collection 
of sets in 2, , B, is the Borel field determined by the F. , and @; eB; then the Borel field 
of sets of the form M92 where M, €®; is the same as the Borel field determined by sets 
of the form FD: . 

* The fact that if two (countably additive) measures agree on a field then they agree 
on the Borel field it determines is best proved by use of the theorem (VI, p. 85) that the 
normal family determined by a field is the same as the Borel field determined by the field. 
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Now we shall prove that each 7; takes sets of B’ into sets of B’ and sets 
of I’ into sets of Mt’. Because we know that M’ C M and that the 7, are 
m-measure preserving it is sufficient to prove the former, and because the 7, 
form a group it is sufficient to prove this for0 S ¢< cc. Because the 7i-measur- 


able boxes are a determining collection for ¥’ it is sufficient to show that any - 


m-measurable box goes into a set of 8’. Let then M*(a, b) be an mi-measurable 
box and ¢ a fixed real number, with 0 S ¢ < c. Let M* be the tube through 
M*(a, b), M the base of M*, N = TM and N* the tube based-on N. Then it 


is easily seen that 


T,M*(a, b) = M*.[a+t< G(P)<b+ 4] 


+ lim > Ns. [ko <b+t—F(T_1P) < (k+ 1)27| 
P 


neo k=0 


*[k2™ +a—b S GP) < ko]. 
a 
Now M* and N* are m-measurable tubes and hence in %’; the set for which 
a+t< G(P) < b+ tis in 8’ because G(P) is measurable with respect to 8’, 
and the sets under the summation are 7i-measurable boxes and hence are in 9’. 
Consequently 7',.M*(a, b) « B’.” 

It remains to show that I? C M’. Define the set H by 


H = [G(P) < cl. 


To show that Ii C M’ it will be sufficient to show that if MeM and Mc A 
then M eM’. This is sufficient because any set in It is obviously the sum of a 
countable number of transforms of sets in 9 which are included in A and we 
know that each 7’, takes sets of I’ into sets of M’. 

Now we consider the product space 2 X L (L is the real line and £ will denote 
the Lebesgue measurable sets) and in this product space four Borel fields: 
Mm’ xX &M @2M x &M@L These are defined as follows: M’ X Lis the 
Borel field determined by sets of the form M xX E, where M eM’ and EF e&; 
M’ @ Lis the completion of this Borel field with respect to the multiplicative 
measure defined on it. It X Land M @ L are defined similarly in terms of M 
and %. From the fact that Dt’ Cc M it follows that M’ @ & Cc M @ & and that 
if M « M’ @ & then the measure defined for it as a set of Mt’ @ L is the same as 
the measure defined for it as a set of It @ &. We shall denote this measure in 
& x L by m(M). * 

In 2 X L we are considering points (P, x, t). If M is a (P, z)-set and M: 





** What we are proving here is that a flow defined by (1.1), with measure taken as @ 
product measure, takes measurable sets into measurable sets. 
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is a (P, t)-set and if M contains the same pairs (P, y) as M, we shall write M = 
M,. If Mc @ X L we shall use the following notations: 


Mes) = [P, a, t) e M] 


M' = bP a, t)e M]) 


~ 


M, = oh x, t)e M). 


Now we consider the transformation S of 2 < L onto itself defined by S(P, z, t) 
= (P,x,t+ a). This transformation takes sets of any one of our four Borel 
fields in 2 X L into sets of the same Borel field and preserves #i-measure. We 
shall indicate the proof of this for @ X Land M @ &, the proof for the other two 
being the same. First, if M = M x (a, b) then obviously SM is a limit of finite 
sums of such sets. Since S takes sums into sums and complements into comple- 
ments it follows that S takes sets of M@ X & into sets of ML. The t-sets Mvp.2) 
and (SM).p,.) are obviously congruent (for all (P, x)) and so have the same 
tmeasure. Hence, by Fubini’s theorem M and SM have the same fi-measure. 
Because S preserves the measure of sets in I X L it must take sets of M @ & 
into sets of M @ &. 

Now let M « It, M C H, and consider the set *M defined by 


*M = (Ss. [EA eR CEO 58 <q 0S <4) 


Becaus 7’, is a measurable flow and S is a measure preserving transformation in 
2X L,’M «It @ & We shall call this *M the associated set of M. It has the 
following readily established properties: 


(2.3) for0 Sz <ce: *h, = M 
*M' = H [ PeM)].” 


(P,2) 


(2.4) forO St<e: 


We now prove a sequence of assertions which will show that Jt C M’. We 
number them for convenience. 

1) If M e M and *M is its associated set then for almost all t, *M* is in M’ 
and for almost all z, M, is in M. 

Let if « Ti; then for almost all t, *M is #%-measurable (by Fubini’s theorem) 
and for all ¢ it is a box; hence *M*‘ ¢ Mt’ for almost all ¢. If ¢ is such that +m’ 
¢ M’ then it follows from (2.4) that M, « 2.% Hence M, « M for almost all z. 

We now define §t to be the collection of all sets M included in 


H= [0s2<c-[0st<d¢. 
(P,.2,t) (Pitt) 





* M1, denotes the P-set: [ (P, t) eM). 
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and belonging to J @ L which have the property that for almost all x, 7, « y’. 

2) If MC A and M eM x L then M e M. 

To prove this it is sufficient,—since Jt is obviously a Borel field,” —to show 
that M contains all sets of the form M x (a, b) where Me M, so consider such a 
set. We know from 1) that for almost all z, M, belongs to Mt. Now (M x 
(a, b))z = Mz X (a, b) which is in M’ for almost all z. 

3) If M eM (and M C @) then 


m(M) = [ m(M') dt = e[ m(M,) dz. 


This is a consequence of Fubini’s theorem and the fact that m(M') = cm(M 1). 

4) If N « M (and N C A) and has m-measure 0 then m(N.) = 0 for almost 
all x in (0, c). 

If N has 7i-measure 0 then 


, [ T.(P, 2) ¢ N] 


has m-measure 0 (by Fubini’s theorem and the fact that 7 is measurable). 
Hence the associated set, *’, of N has fi-measure0. Then from 


0 = m(*N) =c [ m(N,) de, 


we see that NV, has m-measure 0 for almost all x in (0, c). 

5) If M « M (and M Cc #M) and m(M) = 0 then m(M,) = 0 for almost all 
zx in (0, c). 

By definition we know that M, « Dt’ except for an x-set (which we call 6) of 
Lebesgue measure 0. Also Mv,z) has t-measure 0 except for a (P, x)-set in M 
of 7-measure 0. Denote this exceptional (P, x)-set by N. Then, by 4), except 
for an z-set (which we call 62) of Lebesgue measure 0 we have m(N,) = 0. We 
shall now show that if x¢ 0, + 62 then m(M,) = 0. Consider such an x. Then 
Mz « M’. Then N, has m-measure 0 and for P outside Nz we know that 
m(M p,2)) = 0. Hence, applying Fubini’s theorem to the set M, (using the fact 
that it is in 2%’ and has ¢-measure 0 for almost all P) we see that m(M.) = 0. 

6) If Mc Aand M eM @ Lthen M eM. 

We already (by 2)) know this to be true when M « M X &, so it will be suffi- 
cient to show that if M « M Xx 2 and m(M) = 0 and if N C M then N e&. 
This is true because we know by 2) and 5) that for such an M we have M, « M’ 
and m(M,) = 0 for almost all a, which implies that Nj. « M’ for almost all z. 

We can now conclude our proof. Let M e M, Mca. Then the associated 
set *M eM @ Wand * Mc Ai; hence by 6) Me wT, so *M, ¢ M’ for almost all 2 
in (0, c). Since *M, = M far all x in (0, c) this means that M « 2’. Thus 
M < M’ and hence M = M’. 





7 T.e. M is a Borel field when only sets within # are considered and complements are 
taken with respect to 7. 
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TuzoreM 2. Every measurable ergodic flow is isomorphic to a flow built under 
a function. 

Proor: Let S; be the flow, Q* the space on which it occurs, and m*(M*) the 
measure on Q*. We want to find three things,—a space Q, a transformation 7’, 
and a function f(P),—such that the flow 7, defined in terms of them by (1.1) 
is isomorphic to the original flow. Then we shall be able to apply Theorem 1 
to conclude that 7’, is built under a function, i.e. that the measure involved is a 
direct product measure. The 2 we choose will be a certain subset of 2* obtained 
by a judicious picking of sequences of points along the trajectories of S,, the 
transformation 7’ on Q will be the one taking each point of Q into the next point 
along the trajectory (under S,) which lies in Q, and the f(P) will be the length of 
time (measured in terms of S;) it takes to move (under S,) from P to TP. 

Let M* be an m-measurable set with 0 < m*(M*) < m*(Q*) and let 9(P*) 
be its characteristic function. For P* fixed and outside a certain invariant 
(under S,) set Ny, of m*-measure 0 y(S,P*) is a measurable ¢-function. By a 
theorem of Wiener” we know that 


I [ o(S.P*) dt > (P*) feve®, 
€ 


for almost all P*. Hence (applying Egoroff’s theorem) it is possible to choose 
a number, a, with 0 S a < 1, such that for 6(P*) defined by 


a(P*) = : [ o(S.P*) dt® 


the P*-sets 
Mi = [a(P*) <2], Mr = [a(P) >¥] 


both have positive m*-measure. Also, for P* ¢ N. Y the ¢-function 6(S,P*) is 
continuous. In fact it is obvious that, for P* ¢ NV t , 


(2.5) | @(S,P*) — (8,P*) | < (2/a)|t — s!. 


Applying the ergodic theorem we know that for P* fixed and outside a certain 
invariant set No of m*-measure 0 the trajectory S,P* will have points in common 
with each of My and Mg for arbitrarily large and negatively arbitrarily large t. 
Now we write Qf = Q* — [Ni + Ne] and QF = Nt + Nz; then Of and OF are 
m*-measurable invariant sets and m*(Q2 ) = 0. We define 2 by 


Q = OF.[(P*) = 4 and (S8,P*) > } for all tin 0 <t  a/8). 


For the moment we shall not prove that © is an m*-measurable set; this follows 


from the fact, proved below, that G(P) is an m-measurable function. 
ENE 

* Wiener, XI, Theorem III’, p. 2. 

* #(P*) is obviously m*-measurable. Throughout this proof a is a fixed number. 
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Now we show that for any P* in OF the trajectory S,P* has points in common 
with Q for arbitrarily large ¢; the same proof shows this to be true also for arbi- 
trarily negatively large ¢. If f) is any positive number | then there exist t; and 
tg such that & < 4 < t and S;,P* « MY , Si,P* € M:. Because (S,P*) is 
continuous in ¢ there exists a ¢ in (4, &) for which ®(S,P*) = 4. We choose 
t' to be the greatest such ¢in (4; , 2). Then S,P* eQ because ©(S,P*) = 3 while 
(2.5) implies 


| 2 — t’| 2 (a/2)| S,, P*) — (Sy P*)| > a/8, 


and hence that 6(8,S,P*) > 3 for allt in0 < ¢ S a/8. 

We define 7 and f(P) as follows: if P ¢ Q then there is a smallest positive num- 
ber, t; , for which S;,P € 2.” Then TP = S,,P and f(P) = t;. We note that 
f(P) > a/8 for all P in Q. 

Now consider the region @ under the graph of f(P)”, i.e. the set of points (P, z) 
for which 0 S x < f(P), and let 7; be defined on & by (1.1). We establish a 
1:1correspondence, R, between QF and @ as follows: if (P, x) ¢& then P* = S,P 
is its corresponding point in Qi. This is obviously a 1:1 correspondence which 
carries S; into 7,, ie. if P* = R(P, x) then S,P* = RTP, zx) for allt. In 
other words, S,P = RT RP for all Pin &. Now let m(M) be the measure on 
& carried over from Q} by R.” If we knew m-measure to be the direct product 
measure of a measure on Q (for which 7’ was a measure preserving transformation 
and f(P) a measurable function) with Lebesgue measure on the z-axis then our 
theorem would be proved. To show this it is sufficient, by Theorem 1, to show 
that the functions F(P) and G(P) are 7i-measurable. 

To show that G(P) is m-measurable let b be any number = 0; then 


RIG) <b] = ie. SQ, 


so we must show the set on the right side to be m*-measurable. This is shown 
by the following equalities (in these formulae some of the indices range over a 
whole interval; we adopt the following convention: when such a sum or a product 
is primed then the summation or product is to be taken over all rational numbers 
in the interval plus the end points of the interval): 


> sa 


. OStsb 


pe ae, Sll@P*) = }][6(S,P*) > 4 for0 < s < a/8}} 





3° That there exists a positive number ¢; with S :P e & follows from the last paragraph; 
there will be a smallest one (and it will be > (a/8)) because if P ¢ @ then none of the points 
S,P, for0 < t S (a/8), arein Q. 

*! In this discussion we sometimes consider 2 as a subspace of Q* and sometimes as a 
space by itself. 

%2 T.e. a set M is m-measurable if and only if its correspondent in % is m * measurable 
and its measure, m(M), is the m*-measure of that correspondent. 
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{[@(S_.P*) = 3][@(S..P*) > 3 for0 < s < a/8}} 


<b 


Fs Me 


ai 


’ {{|@(S.P*) — 4| < 1/n][@(S..P*) > 3 for0 < s < a/8}} 
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mm, 


tl 
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1] 
4. 


’ [| @(S.P*) — 3| < 1/n] II [®(S,.,P*) > 4 forl/kSss a/s}} 


Ostsb 
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lA 
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for l/kSss a/sh} 


=I] > m, {16S Ps) — $| < 1/n] II > I’ wir) 23+ i/m}, 
n=1 0<t<b 1 m=1 1/k<s<sa/8 
which is pene an m*-measurable set. 
To see that F(P) is m-measurable we note that 
[F(P) > b] = 2 Ta[G(P) 2 ). x 
P O<t<b — 

We remarked in the introduction that Theorem 2 implies that, except for a 
trivial kind of exception, the trajectories in a measurable flow are always meas- 
urable sets. We shall now prove this. If S; is any measurable flow on a spaces 
0* then we can subdivide 0* into disjoint m*-measurable invariant sets: 


=O +O +--+ OR 4---, 


(it may be that a finite or an infinite number of the 0% are empty; in fact for an 
ergodic flow all but one will be empty) in such a way that: lhifn2 J then either 
0% is empty or else m*(2%) > 0 and S; i is ergodic on 0% , 2) either 23 is empty or 
else S, is completely non-ergodic on 9} , i.e. if M* is any measurable invariant 
set of positive measure then M* cottaliiie an invariant set, N*, for which m*(M*) 
> m*(N*) > 0. We see that every trajectory lying in 9 is measurable as fol- 
lows: for each ¢ > 0 it is clear that 2) can be divided into a finite number of dis- 
joint measurable invariant sets (whose sum is 2)) each of measure < «. Each 
trajectory i in 2 will lie in one of these sets. Hence each trajectory in 9} lies 
in some set of measure < ¢, and this for arbitrary positive «. Consequently 
each trajectory in 2) lies in a set of measure 0 and therefore. (since we always 
assume our measures to be completed) each trajectory in 2) is a measurable 
set of measure 0. 

Since on each Q% (for n = 1) which is non-empty S, is ergodic (and since we 
took care of 25 in the last paragraph) the matter of the measurability of the tra- 
lectories will be cleared up completely if we can clear it up for ergodic flows. 
By Theorem 2 it will be sufficient (in considering ergodic flows) to consider flows 
built under a function. Let S; be such a flow, on a region &, and let Q, 7, f(P), 
iM), m(M) have their usual meanings. Clearly the ergodicity of S, implies that 
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T is ergodic on. We now distinguish three cases: 1) every point in Q is m-meas- 
urable and of m-measure 0, 2) every point in 2 is m-measurable but some point 
has positive measure, 3) some point in Q fails to be m-measurable. First we 
consider case 1) (the usual case). Let S,P be any trajectory; this trajectory 
intersects Q in a sequence of points P, = (P,,0). Then this trajectory is the 
set 








[ P=P,] 


n=—o (P,2) 






























ea which is #-measurable and of #-measure 0, since our measure is a direct product 
tet measure. In case 2) the ergodicity of T implies that © consists of a finite number 
thy of points of equal positive m-measure plus perhaps a set of measure 0. Then S, 
is obviously isomorphic to the well known periodic flow on the circle (Siz = 1 +t 
mod a, a > 0). In this case all the trajectories are m-measurable sets but a 
certain one of them has positive measure while all others have measure 0. In 
case 3) it is clear (since m-measure is completed) that the m-measurable sets will 
contain a “chunk,” i.e. an m-measurable set, M, of positive m-measure and 
such that if N is m-measurable and contained in M then either m(N) = m(M) 
or m(N) = 0. The ergodicity of T implies that M has a finite number of trans- 
forms and that every m-measurable set of positive m-measure differs by at most 
a set of m-measure 0 from some finite sum of transforms of M. Then our flow 
*is essentially the same as in case 2) except that instead of all the measure being 
concentrated on the trajectory swept out by a single point all the measure is 
concentrated on the set swept out by M; clearly every 7-measurable set must be 
(to within a set of 7-measure 0) of the form: 


> S.M, 


tek \ 


 -— —”)h << 


where £ is any Lebesgue measurable ¢-set on some finite interval (0, a). (Ob- 
viously in this case it is possible to set up a measure preserving set-correspondence 
between this and a flow of case 2) so in a certain sense this is the same as case 2)). 
In this case there will be at least one non-measurable trajectory inside the set 
swept out by M. This is the only case in which there can be any non-measurable 
trajectories and this case is obviously of not the slightest interest. 

To summarize the results of this discussion we may say that if 7; is any 
measurable flow then the space on which it occurs can be split into a completely 
non-ergodic part, in which all trajectories have measure 0, plus a denumerable f 
number of ergodic parts (where a finite or infinite number of these parts may be | ¢ 
empty) and that on each ergodic part there are three possibilities: 1) each 
trajectory may be measurable and of measure 0, 2) each trajectory may be 


measurable but with one of the trajectories (necessarily a periodic one) having h 
positive measure, in which case the flow is isomorphic to a rotation on the circle, 0 
3) all the measure may be concentrated on the set swept out by a single set, M, tl 


of measure 0 in such a way that the only m-measurable sets are sums of trans- 
forms of M; in this case non-measurable trajectories are possible, but it is possible 
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to establish a measure preserving set correspondence between such a flow and 
a rotation on the circle. 


3. The group U; and a second representation theorem 


Let S; be a flow on a space * and let L2 be the space of (complex-valued) 
functions g(P*) of integrable square on 2*. We define the operator U, (for all 
real t) by 
(3.1) Ug(P*) = 9(S.P*).” 


Obviously the U; are unitary operators and form a 1-parameter group: U.U, = 
Us. It is known that if U; is a continuous ¢function (in the sense that, for 
any g in Lz, || Ug — g||—> 0 as t + 0) then U, has a spectral resolution, i.e. 
there exists a unique spectral family EZ, for which 


vid [ ea, ™ 
and it has been shown by Doob (III, Theorem 7) that if U, is defined by (3.1), 
where S; is a measurable flow, then U; is a continuous t-function.” Hence U, 
will have a spectral resolution in all cases we are considering. 

We now define a special kind of flow built under a function and show that a 
measurable ergodic flow is isomorphic to a flow of this kind if and only if the 
group U, has an eigenfunction of eigenvalue ~ 0. This result was proved,— 
except for the measure theoretic considerations,—by G. D. Birkhoff (II, p. 144) 
and our proof consists essentially in using his argument and then applying 
Theorem 2 to take care of these measure-theoretic considerations. 

Derinition 9. An eigenfunction of the group U; is a function ¥(P*) in L» 
which is different from 0 on some set of positive measure and such that for some 
real number A, 


Uw te) ey 
(for all real ¢). The number ) is called the eigenvalue of yp. 
DeFIniTI0on 10. If a flow is built under a function f(P), and f(P) is a con- 


stant function” then we say the flow is well built. The number f(P) is called the 
height of the flow. 





* The introduction of this group of unitary operators and the idea of attempting to 
characterize the properties of a flow in terms of properties of the spectral resolution of 
this group of operators is due to B. O. Koopman, V. 

“See VII and IX. 

_ SIfS-is a measurable flow and U, is defined by (3.1) then (by Fubini’s theorem) (Uf, g) 
‘$a measurable ¢-function for each f and g in L: and Stone (VII) and von Neumann (IX) 
have shown that in case Lz is separable this implies that U, is a continuous ¢t-function. In 
our case, however, where no separability conditions are assumed, it is necessary to use 
the deeper result of Doob to prove that U, is a continuous t-function. We remark that 
for a measurable ergodic flow this continuity follows easily from Theorem 2. 

* By constant we mean constant everywhere. 
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THEorEM 3. A measurable ergodic flow is isomorphic to a well built flow if and 
only if its corresponding U, group has an eigenfunction of eigenvalue ~ 0. The 
heights of the well built flows to which it is isomorphic are the numbers 2x/\ ) |, 
where d is an eigenvalue of the U;. 

Proor: If 7; is a well built flow of height a and if is defined by 2/)\ = 
then obviously the function ¥(P, x) defined by 

W(P, x) = 
is an eigenfunction of eigenvalue \ for the corresponding U;. Since the eigen- 
values form (for an ergodic flow) an additive group of real numbers” —) is also 
an eigenvalue. Hence if a measurable ergodic flow is isomorphic to a well built 
flow of height 27/ | \ | then both \ and —) are eigenvalues of the corresponding 
group U;. 

Now let S; be a measurable ergodic flow on a space 2* and let the corresponding 
U, have an eigenfunction y of eigenvalue \, (A #0). Then yis an eigenfunction 
of eigenvalue —\.” One of \ and —) is positive; we suppose is positive and 
from now on work with \ and y (if —\ were the positive one we would work in 
the same way, but with —\ and ~). Because y is an eigenfunction we know 
that for each t we have 

W(S:P*) = ey (P*), 

for almost all P*. By changing ¥(P*) at most on a P*-set of measure 0 we can 
make this equality hold for all P* and t;* we suppose y to be so altered. Now 
|y | is an invariant function (i.e. | ¥(S,P*) | = | ¥(P*) |) and hence (since 8S, 
is ergodic) | ¥(P*) | = constant = k > 0 except for P* in some m*-measurable 
set, N*, of m*-measure 0. Because ¥(S,P*) = e'y(P*) for all P* and t we 
see that N* is an invariant set. Now writeQ? = N* and Q; = 0* — N*. Then 
define 2 by 


a = oF.[y(P*) = K] 


(where k i is the constant mentioned above). We establish a 1:1 correspondence 
between Q} and 2 X (0, 27/2) as follows: if (P, x) isin Q X (0, 24/d) then make 
correspond to it the point P* in Qf defined by P* = S,P. We define the trans- 
formation T on 2 by TP = S2:,~P. Then this correspondence obviously carries 
S, on Qf into that flow on 2 X (0, 27/X) which is built on the transformation 
Ton. Asin the proof of Theorem 2 this correspondence gives rise to a measure, 
m(M), on 2 M (0, 2m/d) and the only thing remaining to be shown is that this 
measure is the desired sort of product measure. To prove this it is sufficient, by 
Theorem 1, to show that the functions F(P) and G(P) defined by (2.1) and (2.2) 
are mi-measurable. Now G(P) is m-measurable because the set in 2} correspond- 
ing to the P-set for which G(P) < b is the P*-set for which arg ¥(P*) < bd. 
F(P) is m-measurable because it is a constant. 





37 See X, p. 625. 
3 See IV, Lemma 9.1, p. 27. 
* This remark is due to Koopman, V, p. 318. 
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To conclude we point out that our first representation theorem can be used to. 
generalize a theorem of von Neumann’s about the spectral family E, associated 
with the group U,. Let € be the closed linear manifold spanned by the eigen- 
functions of the group U;. Then &, is still a spectral family when considered 
only on Lz — , the orthogonal complement of €“. The set of points of in- 
crease” of this spectral family when it is considered only on L; — € is called the 


band spectrum of the original spectral family. 

TuzoreM 4.” For a measurable ergodic flow the band spectrum is either the null 
set or the whole real line, 

Proor: We shall not give a detailed proof of this theorem. We merely remark 
that von Neumann proved this assuming the flow to be differentiable in order to 
find,—in case the flow was not rotation on the circle,—cross sections whose 
translations over large time intervals were all disjoint. Using our representation 
theorem (and an unpublished but easily proved lemma of P. R. Halmos to the 
effect that on a space containing sets of arbitrarily small measure there exists, for 
every ergodic measure preserving transformation, 7', and positive integer, n, 
a measurable set of positive measure whose first » transforms by T are all dis- 
joint) we are able,—in case the flow is not rotation on the circle,—to obtain such 
cross sections without any differentiability assumptions. As soon as this is 
noticed von. Neumann’s proof can be carried through exactly as in the differen- 
tiable case. 
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“See VIII, Theorem 5.13, p. 189. 

“ See VIII, p. 184, 

“See X, Theorem 5, p. 635. 

“ It has been pointed out to me by Dr. P. R. Halmos that this proof is not correct as it 
stands, but it has also been pointed out to me by Prof. 8. Kakutani that it is possible to 
make it correct by some minor alterations in the definition of the function ¢(P). 

















nae 
RAS a - 


” ve at ; 
eo’, , 
ee aes 























ANNALS OF MATHEMATICS 
Vol. 42, No. 3, July, 1941 


ON THE EXPONENTS OF DIFFERENTIAL IDEALS 
By Exuis Ropert Ko.cHin 
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INTRODUCTION 


In the theory of polynomial ideals, in algebra, there are methods, stemming 
from the theorem of M. Noether, and associated with the names of E. Bertini, 
E. Lasker, F. 8. Macauley, K. Hentzelt, H. Kapferer and P. Dubreil, for finding 


‘the exponent of an ideal, or at least a bound for the exponent. 


~ When one seeks to create a notion of exponent for ideals of differential poly- 
nomials, one is forced, because of a situation revealed by H. W. Raudenbush,’ 
to admit infinite exponents as well as finite ones. The investigation of such 
exponents, finite or infinite, to some extent for differential ideals of a general 
character, and to a deeper extent for differential ideals generated by a form in 


one unknown of the first order, is the object of the present paper. 


If we may refer to §2 below for a definition of the exponent of a differential 
ideal, we shall proceed to enumerate our results. 

Part II, which presents what is possibly the most interesting portion of our 
work, deals with differential ideals generated by a single form A, in one un- 
known, of the first order. The concept of multiplicity of a singular solution of A 
is introduced (§6) and it is shown (§7) that if A has a singular solution of multi- 
plicity exceeding unity, then the differential ideal generated by A has exponent 
infinity. Forms A which have singular solutions, all of multiplicity unity, are 
discussed in §§8-10. Such singular solutions are divided into two classes, and, 
guided by a general theorem due to J. F. Ritt, we secure a decomposition of the 
differential ideal generated by A which puts these two classes into evidence 
(§8). In §10 it is proved, under an additional assumption (regular type) that 
the exponent of the differential ideal generated by A is unity or two according as all 
the singular solutions are in the first class or at least one singular solution is in the 
second class. The differential ideal generated by a form A which has no singular 
solutions, is shown, under a certain additional assumption, to have exponent 
unity (§11). Part IL-eoncludes with a discussion of a type of form which we 
call hyperelliptic. The “intermediate ideals” are found, and they are shown to 
fall into chidins’6f'4 fixed length. 

Part III contains a brief discussion of chains of differential ideals. A theorem 
is proved which gives a bound for the exponent of a so-called principal chain 
in terms of the length of the chain (§14). 

Part I begins with a statement, in abstract form, of a decomposition theorem 
due to Ritt. After the definitions of relative exponent and other terms, there 





? Bulletin of the American Mathematical Society, vol. 42 (1936), pp. 371-373. 
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is proved a theorem connecting the relative exponent of a differential ideal with 
those of the factor ideals in the decomposition just mentioned (Theorem 1). 
Also, a result is presented which relates the existence of a strong basis in a 
differential ideal to the existence of such a basis in the factor ideals (Theorem 2). 
The remainder of Part I deals with special questions whose treatment is im- 
portant for the sequel. 

In Part I, use is made of the theory of resolvents. This theory, as developed 
by Ritt, applies to a differential field whose elements are meromorphic func- 
tions. To permit the use of abstract fields, there is given, in an appendix, a 
discussion which supplements Ritt’s proofs, making them valid in any differential 
field of characteristic zero. 


Notation 


Differentiation (of elements in a differential ring’) is indicated by subscripts; 
thus a; means the j-th derivative of a. If a single letter is used with different 
subscripts to denote different ring elements, then differentiation is indicated by a. 
second subscript; thus, the j-th derivative of a; is ai; , 

Square brackets [], curled brackets { }, and parentheses (), when not used 
as symbols of aggregation, will mean, respectively, the differential ideal, the 
perfect differential ideal, and the (ordinary, algebraic) ideal generated by the 
set of elements they include. These various ideals are supposed to be formed 
in a fixed differential ring which underlies the discussion. When forming an 
(algebraic) ideal, the differential ring is considered as an (algebraic) ring. 

Membership in a set is denoted in the usual way by the symbol «. Set inclu- 
sion is indicated by &, proper inclusion by C. The symbol for the intersection 
of sets is f. 

The notations 


a=b _ (m,n,---), 
a=b [m, n, --+ ], 
a=b_ {m,n,---} 
will mean, respectively, 
a—be(m,n,---), a — be[m,n, --- ], a — be{m,n,--- }. 


The product of a finite number of differential ideals is defined as their product 
when considered as (algebraic) ideals. This is readily seen to be a differential 
ideal. If o:, o2, +++, 0, are differential ideals, their product is denoted by 
0102 +++ Os. 


If o is a perfect differential ideal in a differential ring D, and if de D, then 





_ * The definitions of differential ring, differential ideal, and other terms, are to be found 
ina paper by Raudenbush, Transactions of the American Mathematical Society, vol. 36 
(1934), pp. 361-368. 



























































































742 ELLIS ROBERT KOLCHIN 


a:d, the quotient of o by d, is defined as in algebra to be the totality of elements 
ae® such that adeo. o:d is itself a perfect differential ideal. 

If § is a differential field, then §{u, --- } means the differential ring obtained 
by the differential ring adjunction of u,---. The result of differential field 
adjunction will be denoted by §(u, --- ). 

As is customary, the word form will be used as an abbreviation for differential 
polynomial. 

By a solution of a set of forms in §{y1, --- , yn}, is meant a set of elements 


-™, +++, Of some differential extension field of §, which annul the forms of 


the set when substituted for yi, --- , Yn, respectively. 
Part I. Some GENERAL THEOREMS 


1. The product representation 


Consider a differential domain of integrity © which contains the rational 


‘numbers. We suppose that every infinite system in D has a basis. The set of 


forms in a fixed finite number of unknowns, with coefficients in an underlying 
differential field of characteristic zero, is such a differential domain of integrity.’ 

Aset of differential ideals in D will be called separated if 1 =0 (01 , o2) for each 
pair of distinct ideals o; , o2 in the set. 

A differential ideal will be said to be connected if it is not the intersection of 
two separated differential ideals. Since the two ideals are required to be 
separated, “intersection” may be replaced by “product.’” 

A theorem due to Ritt may be formulated abstractly as follows: 

Let o be a differential ideal in D. Then o is connected if and only if {c} 1s. 


‘If o is not connected, it has a representation as the intersection (= product) of a 


separated finite set of connected differential ideals. This representation is unique. 
If o1o2 --- o, is the representation of a, then {o1}{o2} --- {os} is the representation 
for {o}.° 

We shall call the representation described in this theorem the product repre- 
sentation of o. Each o; will be called a factor of the product representation. 


2. Exponents 


Lema 1: Let o and r be differential ideals in D such that o Gr & {co}. Then 
the product representation of « has the same number of factors as that of +, and, with 
a suitable assignment of subscripts, we may write for these representations 


o = 0102 +++ Gs, T =. 7172 +4 *. Te, 
0: & 73 & {oi}, te I, +++ 8 


Proor: The theorem of §1 shows that we may write for the product repre- 
sentations 


(1) 





3 See the paper cited under footnote 2. 
‘See B. L. van der Waerden, Moderne Algebra, vol. 2, Berlin, 1931, p. 46. 
5 Ritt, Proceedings of the National Academy of Sciences, vol. 25 (1939), pp. 90-91. 
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o = 0102 --* Gs, FEU PEre MAS Fe. 
{oi} = {7}, $=], ..-,8 
Here product is the same as intersection, and o © 7 € 71, so that 
c=(1Nn)NaN.--No = (aN n)or--- 0. 


Since the product representation is unique, this implies that 0: = o; M 71, that 
iso, © 71. Similarly, every o; © 7:. 

Consider two differential ideals ¢ and 7 in D. If there is a positive integer q 
such that r+‘ & a, let p be the least such q; if no such g exists, let p = ©. We 
shall call p the exponent of o with respect to r, and shall denote it by lc. The 
exponent of o with respect to {o} will be called, simply, the exponent of o. 

In discussing l,¢ we limit ourselves to cases for which ¢ © 7 © {eo}. If 
r¢ {o}, then lo = ©; if o & 7, we may let 7’ = (co, 7), and then o & 7’, 
Lo = lo. 

TuzoreM 1: Let o and 7 be differential ideals in D such that o © 7 © {co}. 
Let the product representations of o and r be given by (1). Then 

Lo = max l,,0;. 
i=1,°++,8 

Proor: Let p = max l,,0:, gq = Lo. If p = ~, then g S p. Suppose 
p<, Then 


Tr” = (1 -++ me)” = thee 12 G1 OF =, 


so that in either case g S p. It remains to prove that p S q. 
Ifg = », then p S g. Supposeg < . Then 


oi --- of = (a, +--+ o,)* = coc T= T1+** Ts. 


But o{,---,0; are obviously the factors of the product representation of 
oi++-o;. Hence, by Lemma 1, of G 7;, 1 = 1, --- , 8, that is, p S g. 
3. Strong bases 


In this section we present a theorem which will not be used in the rest of the 
paper, but which may be of some interest on its own account. 

Let ¢ be a differential ideal in D. If o has a strong’ basis b,, --- , bs, let 
q = l.[b: , --+ , bs); of all integers so obtained, let p be the minimum. If o has 
no strong basis, let p = ©. We shall call p the basis index of o. 





*It is easy to see that 0,171,062, ---,0, are separated. For example, because 
1,02, °+- , o, are separated, there are elements h; € 0; and hz ¢ a2 such that 1 = hi + he. 
Choosing, as we may, p large enough so that he e7,, we have hg eo, 1) 71, phahe + 
“+ hE eop and 1 = he + phe-the + --- + hp. 

"A basis of o is said to be strong if there is a single integer such that g‘ is in the differ- 
ential ideal generated by the basis, for every g¢o. There exist differential ideals which 
do not have a strong basis. See Theorem 5 below; also, Kolchin, Bulletin of the American 
Mathematical Society, vol. 45 (1939), pp. 923-926. 
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TuEorEM 2: Let the differential ideal « in D have the product representation 
o = 0,--+o,. Then the basis index of o equals the maximum of the basis indices 
of the o;. 

Proor: We know that there exist elements m;, with derivative ma ¢¢, such 
that o; = (c, m;),i = 1,---,8.° Let bi, --- , b, be a strong basis of o, with 
I.{bi, +--+, ba] = p. Then 


a? = (a, m)” & (0, mi) & [bi , --- , ba, mi. 
Thus, for each z, b; , --- , bx, m;is astrong basis for o; , and /,,[b: , --- , bx, mi] < 
p. Hence the basis index of o is not less than that of each o;. Now let 
bf”, coe, by? be a strong basis of Oi, and let qd P-4 1,,[b1”, ante bi? |, t= 1, “09 9G 
Since o; = (o, m;), we may write, for each 7 and j, 


ore So Be 
bf? = tf? + ¢}?m, tf” €9. 


Then 


c= (o1 +++ o)* = of «++ of 


() (1) (s) (s) 
CS [br , «++ , bay vee [BI +, OR] 
1) 1) ) 
& (ef, 8, --) ho, ma, oo | Man, Mh +++ Mel. 


Thus, o has a strong basis whose associated exponent is less than or equal to q, 
so that the basis index of ¢ is not more than the greatest basis index of the o;. 


4. Invariance of the exponent under differential field adjunction 


In this section we specialize D to a differential ring of forms. 

Let § be a differential field of characteristic zero. Let §’ be a differential 
extension field of §: § €& §’. Throughout the present section y;, ui, w will 
denote unknowns. ~ 

‘If © is a set of forms with coefficients in §, we shall use (&), [®], {} to denote 
the algebraic, the differential, and the perfect differential ideals, respectively, 
generated by © in the differential ring of forms with coefficients in §. If isa 
set of forms with coefficients in §’, the algebraic, the differential, and the perfect 
differential ideals generated by @ in the differential ring of forms with coefficients 
in § will be indicated, respectively, by (€)’, []’, and {®}'. It is clear that if 
2 is a differential ideal of forms with coefficients in §, then (2)’ = [2]’. 

THEorEM 3: Let 2 and A be differential ideals in F{yi, --+ , yn}. Then 


lad = Ic aye(Z)’. 


Proor: Let p = 1,2, q = kay-(Z)’. Ifg = ©, thenp Sq. Supposeg < %, 
and let GeA. Then Ge(A)’, so that G%e(z)’. But Ge Fly, -++ 5 Yni- 
Hence G* ¢ 2,’ so that p < q. 





8 Ritt, loc. cit. under footnote 5. 
® See proof in van der Waerden, loc. cit., p. 67. 
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Ifp = ©, theng S p. Suppose p < o, and let Ge(A)’. Then there is a 
relation 


G= > 4,G,, ial G,¢ A. 


p=l 


ee DY . ait. Hiatt... Gir e (ays (ay, 


yt s+ t+in=p i! eee tn | 


so that g S p. This completes the proof. 

Theorem 3, by itself, yields no information as to how the exponent of a differ- 
ential ideal is affected by a differential field adjunction, for there is no obvious 
reason why {}’ should equal ({2})’. These two ideals, however, are identical. 
We devote the rest of this section to the proof of this fact. We use three lemmas. 

Lemma 2: Let Q be a prime differential ideal in §{u , +++ , Ug, W, Yr, +++ 5 Yoh, 
with basic set 
(2) A, A1,+++,Ap 


introducing in succession W, Y1,-+++,Yp. Let A be algebraically irreducible 
over §, and let each A; be of order zero and degree unity in ys Then (2) is a basic 
set for {Q}’. 

Proor: Let Bi, --- , Bs be the irreducible factors of A ‘over %’. Let S and 
I be the separant and initial, respectively, of A; let J; be the initial (= separant) 
of A;. Each B; is of the’same order (call it r) in w as A is, for otherwise A 
would be reducible over §. Let 7; be the separant, J; the initial, of B;. We 
have . 


Fe Jyi id; 
8 = TBR, EB NOR ee BB, --. 
Now, 2 = {A, A,,---,Ap}iISI,--- Ip. Let 
ihe AMAA ehh Manse Bosc ic) VM dy- ory @ 


Then each Q; is a prime differential ideal in §{m,--- , Ug, W, Yr, +++, Yp}e” 
Moreover, 





" Pnoor: Let CDe wh . For appropriate k we may write (J,---I,)*C = C’, 
‘+ Ip)*D = D' [A,, +--+ , Ap] where C’ and D’ are forms free of y1, --:, yp. Of course 
on €Q2;. We prove one a C’, D’ is divisible by B;. Suppose neither is. Then there 
are forms M and N, free of y:, +++, yp, such that E = M(C’D'T Ji], +++ Ip) + NBi, 
where E is a nonzero form free of y:,--+,¥p, of order less than r in w. Clearly 
Ee{B;, A1,-+++, Ap}. It readily follows that (TI; --- Ip)'E ¢(B;) for some positive 
integer J. Since B, is irreducible, this implies that E « (B;). But the order of E in w is 
less than r. Hence E = 0. This contradiction shows that either C’ or D’ is divisible by 
B;, say C’ is. Then C €Q;. Thus, Q; is prime. 
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{Q}’ {{A, Ai,-++,Ap}:ISh --+ Ip}’ 


ea Se ce 
<4 ({Bi, Ai, ++, Ap} A A {B., Ai, yiicies , A>} IS, .-- I 
= ({Bi, Ai, cea ,A,}/:ISI, ---I,)N--- 


N ({B,, A1,+++, Ap}’:ISh --- I,) 
= ({By; Ar, -:-; Ay}'tdi «TTB By te dD 
N.-- NAB, , Ary.+++ Ap} Sdace SoBe + +> TJ; --- I,) 
a (Ry 6 SBE BYR ORR EN Bee 
=9N..-NQ.” 


{Q}’ does not contain a foi of class g + 1 of order less than r in w, because 
Q; does not. Let C be of class g + 1 and have order r in w, and suppose that 
C « {Q}’. C €Q;, and is therefore divisible by B;, 7 = 1, --- ,s. Hence C is 
divisible by A. From this it follows that (2) is a basic set for {Q}’. 

Lemna 3: Let y contain non-constant elements.” Let = be a prime differential 
ideal in ${us,-++, Ue, Yi, +++, Yp}, with arbitrary unknowns uw, +--+ , tig.” 
Then (2)’ = {Z}’. 

Proor: It suffices to show that {Z}’ € (2)’. Let A = O be a re- 
solvent for 2, and let @ be the associated prime differential ideal in 
Olu, +++, Ug, W, Y1, +++, Yp}, with basic set (2), where A is of order r in w, 
and each A, is of order zero and degree unity in y; ."* We know that 


= ON Flu, -++ Ug, Yr, ++ 5 Ypt- 
Let Ge {Z}’. We may write 
G = Go + Gio + foe + Gnwm 


where each G, ¢ §{ui, --- , Ug, Yi, +--+, Yp}, and where 1, w1, --+ , @m are ele- 
ments of %’, linearly independent over §. We shall show that each G, ¢ 2. 

Let S and I be the separant and initial of A; let I; be the initial of A;. Evi- 
dently there exist non-negative integers a, b, a; such that 


(3) PSIz -.. 17G, = Ry), w= 0,1,-++5m, 
where each R, is reduced with respect to the basic set (2). Then 
(4) P°S'T} ... TPG = Ry + Ry + +++ + Resom(O). 
But, by Lemma 2, (2) is a basic set for {@}’. Hence the second member of (4), 





11 The parentheses in these equations are symbols of aggregation. 
12 An element of § is called constant, if its derivative vanishes. 

% The u; need not actually occur. ° 

14 See Appendix. 
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which is in {@}’, yet reduced with respect to (2), must vanish. Since 
1, wi, «++» @m are linearly independent over §%, it follows that each R, vanishes. 
(3) then shows that each G, «@. Since G, is also in F{w, +--+ , Uy, Yr, +++, Yo}, 
we have G, ¢ 2,4 = 0,1, --- ,m. 

Hence G « (2)’, so that {2}’ & (2)’. 

Lemma 4: Let § consist purely of constants; let %’ = §(x), where x is an element 
whose derivative is 1. Let = be a prime differential ideal in §{yi, --- , yn}. 
Then (2)’ = {Z}’. 

Proor: We must show that {2}’  (2)’. LetGe{Z}’. Fora suitable non- 
zero element ¢ ¢ (x), we may write 


0G = Ay + Aizt+---+ Ane”, 
where the H, are in §{y1,---,Yn}. For a large integer r, G’e(2)’. Hence 
Hi + rHo ‘Hit + --- + Haz € (2)’. 
Thus, we see that we may write 
Hy + rHo ‘Hix + eee + | | aa = dS, + --- + dS: , 
where each S;¢ 2, and each d; ¢ (xz). Let w, ---,w: be elements of §(z) 


such that 1, z,---,2”", w, --+ ,@: are linearly independent over §, and such 
that each d; is linearly dependent on 1, z, --- ,2”", w1,---+,@:. Then there 


is a relation 
Ho + rHp ‘Hit +--+ + Hn = To +) HT mt + Un + ++» + Un, 


where the 7; and U; are in 2. By the linear independence it follows that 
Hj = T), so that Hype 2. Hence 


Hi + Hee + --- + Het” € {2}’. 


Continuing, we find every H, «2. Therefore G ¢(2)’ and {2}’ € (2)’. 

We are now in a position to prove our theorem. § is again any differential 
field of characteristic zero, and §’ a differential extension thereof. 
, pig 4: Let 2 be a perfect differential ideal in ¥{yi,---, Yn}. Then 
z)' = {d}’. 

Proor: Let 2 = =f --- 1, be the decomposition of = into essential prime 
differential ideals. It is easy to see that 


(5) {x} = {ZV N... N {zd}. 
We shall prove that 
(6) (2)! = (2) N--- N (2). 


Indeed, since = € 3, , we have (2)’ € (2,)’,4 = 1, --- , 8, that is 
Cy ey 1. Nis 
Now let Ge (3,)’, i = 1,---,s8. We may write 
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{Q}" 


{{A, Ai, +--+, Ap}:DSH +--+ Ip}’ 
= {A,A,,---,Ap}’:ISh --- Ip 
= ({Bi, Ai, +++, Ap}'N--- N{B., Ar, +++, Ap}: TSH --- I, 
a (1B,, Ay, «++ ¢ del aay s+ Eadans 
n ({B, ’ Ai eget PL, A,}':ISI, Pecks I,) 

= ((By, Ay, -:- 5 Ap}’tdr > 3 TPB ee 

n eth n ({B., Ai, “Wis » Ap}: Ji A J.B, Be °° TI; -++ I,) 
= (Qyidy +++ SeBe +++ By N+ AO (Qytdi «++ Se aBy +++ Bia) 
= Q; Nn ¢ : : Nn Whe 

{Q}’ does not contain a form of class g + 1 of order less than r in w, because 
Q; does not. Let C be of class g + 1 and have order r in w, and suppose that 
C « {Q}’. C €Q;, and is therefore divisible by B;, 7 = 1,---,s. Hence C is 
divisible by A. From this it follows that (2) is a basic set for {Q}’. 

Lemma 3: Let § contain non-constant elements.” Let = be a prime differential 
ideal in ${ui, +++, Ug, Yry ++ Yn}, with arbitrary unknowns uw, --+ , U,.” 
Then (2)’ = {Z}’. 

Proor: It suffices to show that {2}’ € (2)’. Let A = 0O be a re- 
solvent for 2%, and let @ be the associated prime differential ideal in 


Blur, +--+ ,Uq, W, Yi, +--+, Yp}, With basic set (2), where A is of order r in wv, 
and each A, is of order zero and degree unity in y; .'* We know that 


x T= AN Flu, +--+, Ue, Yr, -+* y Yo}: 
Let Ge {Z}’. We may write 
G = G + Gis + eee + Gnwom 


where each G, ¢ F{wi,--- , Ug, Yi, --*» Yp}, and where 1, wi, --+ , m are ele- 
ments of %’, linearly independent over §. We shall show that each G, e 2. 

Let S and I be the separant and initial of A; let J; be the initial of A;. Evi- 
dently there exist non-negative integers a, b, a; such that 


(3) Sip ... 1G, = R,(Q), p= 0; 1,+++ 5m, 
where each R, is reduced with respect to the basic set (2). Then 
(4) PSY... 7G = Ryo + Ren + «++ + Rewom(O)’. 


But, by Lemma 2, (2) is a basic set for {2}’. Hence the second member of (4), 





11 The parentheses in these equations are symbols of aggregation. 
12 An element of § is called constant, if its derivative vanishes. 

% The u; need not actually occur. . 

14 See Appendix. 
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which is in {Q}’, yet reduced with respect to (2), must vanish. Since 
1,1, ++ » @m are linearly independent over %, it follows that each R, vanishes. 
(3) then shows that each G, ¢®. Since G, is also in F{w, --+ , Uy, Yr, +++, Yr}, 
we bave G, € 2, u = 0,1, --- ,m. 

Hence G ¢ (2)’, so that {2}’ & (2)’. 

Lemma 4: Let § consist purely of constants; let %’ = 3x), where x is an element 
whose derivative is 1. Let = be a prime differential ideal in §{y:, --- , yn}. 
Then (Z)' = {2}’. 

Proor: We must show that {2}’ € (2)’. Let Ge {2}’. For a suitable non- 
zero element ¢ ¢ (x), we may write 


eG = Ay + Mixt+--- + Ane”, 
where the H, are in ¥{yi,---,Yn}. For a large integer r, G’ e(2)’. Hence 
Hj + rHo (Hit + --- + Hie” € (2)’. 
Thus, we see that we may write 
Ho + rHo ‘Hix + --- + Haw” = dS, + --- + deSk, 


where each S;¢ 2, and each d;¢ §(x). Let w, ---,w: be elements of §(z) 
such that 1, z,---,2”", w1, --- ,@: are linearly independent over §, and such 
that each d; is linearly dependent on 1, z, --- , 2”, w1, ---,@:. Then there 
is a relation 


Ho + rHy ‘Hit + +++ + Aye™ = Ty + +++ + Tmet™ + Un +--+ + Un, 


where the 7; and U; are in 2. By the linear independence it follows that 
Hj = T,, so that Hoe >. Hence 


A, + Hex + see ao Ay” € {r}’. 


Continuing, we find every H, «2. Therefore G ¢(=)’ and {2}’ € (2)’. 

We are now in a position to prove our theorem. §% is again any differential 
field of characteristic zero, and §’ a differential extension thereof. 
| THEOREM 4: Let = be a perfect differential ideal in §{y1,---, Yn}. Then 
zy)! = {2}’. 

Proor: Let 2 = 2, / --- MZ, be the decomposition of = into essential prime 
differential ideals. It is easy to see that 


(5) {ry = {ay N... {ay}. 
We shall prove that 
(6) (Z)’ = (2,)'N.-- N(z,)’. 


Indeed, since > € 2; , we have (2)’ € (2,)’,7 = 1, --- , 8, that is 
(Z)’ & (2,)'N--- N (2,)’. 


Now let Ge (2,)', i = 1,---,8. We may write 
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G = Gi? + Gira + «+» + Pam, 


where each G{” ¢ Z;, and 1,1, --- , amare elements of §’, linearly independent 
over §. The linear independence shows that Gj” = ... = Gf,i =0,1,... ,m, 
so that each Gf? «2, N-.--N 2B, = 3, and Ge(z)’. Hence 


(2)’ D (2) N- + NB)’. 


This proves (6). 
If § contains non-constants, then, by (5), (6) and Lemma 3, {2}’ = (3)’. 
Suppose § consists purely of constants. Let §* = (x), where z is an ele- 
ment whose derivative is 1, and let §* = §’(z). By Lemma 4 and equations 
analogous to (5) and (6), 


(Z)* = (21)* N--- M (2,)* 
= {2i}*M--- M{2}* = {2}*. 
Hence, by the part of the theorem already proved, 
(z)* = ((2)*)? = ({2}*) 
= ({2}8}8 = (2H 
Thus, (2)* is a perfect differential ideal. 

Now, (2)* = ((2)’)'. Suppose that (=)’ * {2}’. Then there is an 
Fe S'{y:,--+, yn} such that Fe{z}’, Fe(z)’. Clearly Fe {z}*, that is, 
F e(=)* = ((2)’)*. Since F € $’{y1, --- , yn}, this implies that F ¢(Z)’. This 
contradiction shows that (2)’ = {2}’, and completes the proof. 

Corouary 1: Let = be a differential ideal in &{y1, +--+, yn}. Then the ex- 


ponent of = equals that of (2)’. 
Proor: By Theorems 3 and 4 


lx)2 = U¢2y)(Z)’ - Uy x)7(2)’ = Uyxy(2)’. 


5. A certain differential ideal 


Let § be any differential field of characteristic zero, and let y be an unknown. 
We work in the differential ring §{y}. The following theorem uses an idea 
due to Raudenbush.” 

TuroreM 5: [y]’ does not have a strong basis. 

Proor: Assume that [y]’ has a strong basis. Then it has a strong basis con- 
sisting purely of forms yy;. Let s be an integer so large that the set of forms 


(7) YY; , 0<:185j85, 


is a strong basis of [y]’. Let p be the exponent with respect to {y]’ of the differ- 
ential ideal generated by the basis (7). Let a be a positive integer whose value 
(large) is to be given later. 





16 See paper cited under footnote 1. 
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Consider the forms 


(8) YiYig *** Yiaps tere + lop = da. 


Each form (8) is in [yl’”. Considering degree and weight, we see that each form 
(8) is a linear combination, with coefficients in §, of forms 


(YY i)Yir --* Yirep-a» 
(9) Osisjss, tt7tkht~pat--- + hope =a. 


Hence the number of linearly independent forms (8) does not now exceed the 
number of forms (9). Since the totality of forms (8) are linearly independent, it 
follows that the number of distinct forms (8) is less than or equal to the num- 


ber of forms (9). 
Now, the number of expressions (8) is clearly at least 


1 ( mers 
sere ie 1) 
(2p)! \2p 
if we suppose that @ is divisible by 2p. For, we may assign 7, --~ , tp-1 arbi- 
trary values from 0 to a/2p, and then 7%, is uniquely determined; moreover, 


the number of times a given form (8) can be produced in this way is at most 
(2p)! On the other hand, the number of expressions (9) is surely not more than 


4(s + 1)(s + 2)(@ + 1)”™, 


for the number of forms in the basis (7) is (s + 1)(s + 2)/2. But if @ is suffi- 
ciently large, then 


1 a <i 2p—2 
ay (gti) > Het De + 20+ 0" 
Hence, the number of forms (8) exceeds the number of forms (9). This contra- 
diction completes the proof. 
Corottary 2: Let P « §{y} be of order zero. Then the exponent of [P] is © 
or 1 according as P has or has not a multiple factor. 
i Proor: By Corollary 1 we may suppose that § contains all the roots of 
= 0. Let 


P = by — m)" -+: Y — ™)", ni ¥ nif i ¥ Jj, 
be the factorization of P into linear factors. It is easy to see that 
(10) [P] = [(y — m)"] --- [y — 2)" 


is the product representation of [P]. If k; = 1, then [(y — »)"] = [y — nd 
obviously has exponent unity. If k; > 1, then [(y — ni)**] has exponent ~. 
For, otherwise (y — »,;)** would be a strong basis for [y — 2J°, a contradiction 
of Theorem 5. Since by Theorem 1 the exponent of [P] equals the maximum 
of the exponents of the [(y — ,)"*], the proof is complete. 
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Part IJ. Tue DirrerENTIAL IDEAL GENERATED BY A Form In ONE Unknown 
or OrpEeR UNITY 


6. Preliminary remarks 


Consider a form A in a single unknown y, of order unity. We seek the ex- 
ponent of [A]. We shall work in a fixed differential field § of characteristic 
zero, which contains the coefficients of A. By Corollary 1, we may (and do) 
assume, without further mention, that § is extensive enough to contain certain 
solutions of A which we shall discuss. 

The exponent of [A] will be found to depend on the nature of the singular 
solutions of A, that is, the solutions of A which annul S, the separant of A.° 
A singular solution of A must annul the resultant with respect to y; , the deriva- 
tive of y, of A and S. This resultant is a nonzero form of order zero, or is 0, 
according as A has no multiple factor of order unity, or has such a factor, that 
is, according as A and S have not or have a common factor of order unity. In 
the former case, A has only a finite number of singular solutions, and these are 
easily found. 

If y = 7 is a singular solution of A, we consider A as a form in z = y — 9, 
and write 


(11) A=DP2z, 
v=0 


where each P, either is of order zero and not divisible by z, or vanishes, and 
where the p, are definite non-negative integers.” We suppose that the degree 
of A in y; is n, so that P, ¥ 0. 

Let m be the minimum of the total degrees of all terms effectively present in A 
considered as a polynomial in z and z,. We shall call m the multiplicity of the 
singular solution y = 7. It is obvious that if A has a multiple factor, any solu- 
tion of that factor is a singular solution of multiplicity greater than unity. 


7. Singular solutions of multiplicity exceeding unity 

We shall prove that if A has a singular solution of multiplicity exceeding unity, 
then the exponent of [A] is ~. 

(a) Let y = 7 be asingular solution of A of multiplicity at least 2, and suppose 
that y = 7 is an essential manifold of A. Let z= y— 7. The product repre- 
sentation of [A] must have a factor whose manifold is y = 7, that is, a factor 
2, such that {2,} = {z}. Now, it is known that 


(12) 1 = [A, z"), 





6 The definition here of singular solution is broader than that given by Ritt in his book, 
Differential equations from the algebraic standpoint, and used in the Appendix of the present 
paper, for A is not assumed here to be algebraically irreducible. 

7 If P, = 0, we let p, be any fixed non-negative integer. 
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where q is any sufficiently large integer.” Because y = 7 is of multiplicity at 
least 2, it follows that A e[z’, zi]. Hence, by (12), 


a Sle’, zi]. 


Thus, the exponent of 2, is not less than that of [z’, zi], which, by Theorem 5, 
is ©, Therefore, using Theorem 1, we see that the exponent of [A] is o. 

(b) Suppose that y = 7 is a singular solution of A of multiplicity greater 
than unity, which is not an essential manifold of A. Let y — » =z. We shall 
show first that {A} contains a form which, when considered as a form in z, 
has at least one term of the first degree. 

Let C;, --- , Cg be the distinct irreducible factors of A of order unity, and 
let F = C,---C,. If T denotes the separant of F, the derivative of F is a 
form Tz, + U, where U is a form of order not exceeding unity. Hence 


Tz = —U [F]. 
It follows, by suecessive differentiations and eliminations, that 
T%2, = V; [F, t= 2,---,n+ 2, 


where the a; are appropriate integers, and the V; are forms whose order is not 
greater than unity. Letting a be the maximum of the a; , we have 


T2:=W: (Fi, i= 2,-+-,n +2, 


where W; = T* “V;. 
Let J be the initial of F. Then J is of order zero. Clearly, there exists a 
non-negative integer b such that 


JWi,=X;: (FP), i=2,.--,n+2, 


where each X; is of order not exceeding unity, and is of degree less than n in z; .” 
It follows that 


JT =X; Fi, i= 2,---,n+2. 


The n + 1 forms X; are linear combinations, with coefficients which are poly- 
nomials in z, of the n quantities 1, z,,---,21 . Hence, some linear com- 
bination of the X;, with coefficients which are forms of order zero, not all 0, 
must vanish: 


n+2 


> K:X; = 0. 
t=2 


We assume, as we may, that the K; are not all divisible by z. Referring to the 
last congruence, we now see that 
LS 


a and Kolchin, Bulletin of the American Mathematical Society, vol. 45 (1939), 
Pp. 895-898, 


® The degree of F in 2, is at most n. 
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J°T’M = 0 ..[F], 


where M = peas Kz. 

Since not every K; is divisible by z, M has at least one term of degree unity. 
Consider the form y — ¢, where ¢ is either a common solution of TJ and F, 
or a root of a factor of A of order zero, but is distinct from 7. Let N be the 
product of all such forms y — ¢, and let Z = MN. 

Z, like M, has at least one term of degree unity (when considered as a form 
in z). Z vanishes for all solutions of F with the possible exception of y = 7, 
and for all the solutions of A which are not solutions of F’, again with the possible 
exception of y = 7. That is, Z vanishes for all solutions of A, save possibly 
y = 7. Since y = 7 is not an essential manifold, Z must also vanish for y = . 
By the analog to the Hilbert-Netto theorem,” then, Z¢« {A}. Z is the form 
whose existence we were to prove. 

Thus, {A} contains a form z; + H, where H, considered as a form in z, has 
each of its terms either of order less than j or of degree greater than unity. 

Assume, now, that the exponent of [A] is finite, say g. Then” 


(2i4i, + Hi) --- (Zitig + H;,) = @ [A] 


for all non-negative integers 7,,---,7%,. Since y = 7 is of multiplicity at 
least 2, as a singular solution of A, it follows that A ¢ [z’, z{]. Hence 


(24%, + H;,) pe (Zi+é, i H;,) = 0; [2’, zi]. 
If we compare terms here of degree g and weight gj + i: + --- + %,, we obtain 
iti, eee Zitig = 0 [2”, zil, i, eee fe = 0, 1, 2, eee, 


This is easily seen to imply, however, that [z]’ has a strong basis. This contra- 
diction of Theorem 5 completes the proof. 

EXAMPLE 1. Let A = p ipeoe" Q,yi (n = 1), where the Q, are forms of order 
zero with constant coefficients, and where Q, ~ 0. 

If A has no multiple factors of order unity, the singular solutions must be 
constants, for they must satisfy an algebraic equation with constant coeffi- 
cients;” since S = }*?_, »Q,y!", it is evident that the singular solutions of A 
are precisely the common zeros of Q and Q,. A singular solution is of multi- 
plicity exceeding unity if it is a multiple zero of Q). Thus, if A has a multiple 
factor, or if Qo has a multiple factor which is also a factor of Q: , then the exponent 
of [A] is ~. 


ae Singular solutions of multiplicity unity; the product representation 


Throughout §§8-10 we suppose that A has singular solutions, and that they 
are all of multiplicity unity. In the present section we shall show that all the 





2” See the paper cited under footnote 2. 
*1 Hi, is the i,%-th derivative of H. 
22 The equation obtained by eliminating z,; from A = 0, S = 0. 
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singular solutions are essential manifolds, and shall find the product representa- 
tion of [A]. With an additional assumption, we shall determine the exponent 
of [A] (§10). 

Since A has no singular solution of multiplicity exceeding unity, A has no 
multiple factors. Hence A has only a finite number of singular solutions (§6). 

Let y = 7 be a singular solution of A, and write z = y — n. Since S must 
vanish for y = 7, we see from (11) that either P; = 0 or p; > 0. This means, 
since y = 7 is of multiplicity unity, that P) ~ 0 and p = 1. (11) thus becomes 


(13) A = Poz + > P,z2”’ 2. 


It now follows from a theorem of Ritt that y = 7 is an essential manifold of A.” 
We now turn to the product representation of [A]. 
Let y = n:,7 = 1, --- , 8, be the singular solutions of A; we denote y — »; 
by u;. It is evident that we may write 


(14) i ee Te 


where the manifold of 2» consists of the non-singular solutions of A,” and where 
the manifold of 2;,7 > 0, is y = n;, that is, {2;} = {u;},i = 1,---,s. 
Referring to (13) and making use of a known result,” we see that 


(15) 2; = [uil, 


The determination of Zp is more difficult. The first step will be to find {Xo}. 
If z is one of the u;, then (13) holds. Differentiating (13) we obtain 


Paeu My 


Ai = Puz + Do Prnz?’2i + a (P. +> p.P.2?"2,) ~ D> vP,2”" zi ' +22, 
v=1 v=1 v= 
where P,, is the derivative of P,. But by (13), 
(16) Poz = —a >, P,z?*zi = (A). 
y=1 


Hence 


PoA: = Po Poz + Pot: >, Pnz’’zit + Pou (P. 4+ >" pP.2?*2,) 
v=] v=1 


+ (Pree + Poz > Pa? *) Ze 





s Ritt, Annals of Mathematics, vol. 37 (1936), pp. 552-617. See especially §5. Professor 
Ritt’s proof is not applicable to abstract differential fields, as it involves function-theoretic 
concepts. But an abstract proof of the sufficiency part of the theorem (we need only the 
sufficiency) has recently been achieved by H. Levi, and may be expected to appear in the 
literature shortly. 

“If A is algebraically irreducible, the manifold of Zo is the general solution of A. 
* See §2 of the paper cited under footnote 18. 
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= 2 (-Po p> P, 22 + Py a Paz’ zy’ + Po (P. + D> p.P,2?"e) 


+ (-Pe > P,z”*zy* + Po Dy wP.2? a5) a), (A), 


so that 
(17) PA, = a(Uaz+V) (A), 
where 
= —P,z?"" > P,2?’zi' + Po = vP, 2" zi”, 
(18) yon yd 
Vi= —Pu 2) Preet* + Pod, Paz ai + Pa( Po +2 p.P.z2), 


From (17) and (16) we find that 
z(P(Uz + PoV) =0 (A, Ai). 
This implies that 
{A} = {2} N {A, PoUz + PoV}. 
The two ideals in the second member here are separated, for by (18) 
(19) PV=Pi40 _ [dl 


so that Poze, + PoV #0 [cz]. 
Now, z represents any one of the u;. Let 


(20) I; = PU, J; = PoV when z= 4%, ¢=1,---,8 
Then by the above, for 7 = 1,---,8, 
{A} = {us} N {A, Ie + Ji}, 
(21) uIue + Ji) =0, waldue+Ji)=0 (A, Ai), 
and y = 7; is not a solution of Ju + J;. It readily follows that 
(22) {A} = {A, Iw + Ji, ---, Teuea + Je} N {uy} M--- A {us}. 


The s + 1 differential ideals, whose intersection appears in the second member 
of (23), are separated, so that we may replace intersection by product. 
Comparing (22) with 


{A} = {2} N {2 N--- M {2,} 
(which follows from (14)) and with (15), we see that 
(23) {Zo} = {A, Tue + Ji, --+ , Dette + Joh. 
We shall now obtain two forms M and N such that 
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(4) Me{m}-++ {uj}, Ne{2}, MNe[A], M+N 
This will enable us to determine 2p. 
Let 
B; = Iva + Ji, 
and let 
Q: = Po when z= 4, 
Then Q; is not divisible by u;, and, by (19) and (20), 
(25) B;- Qi =TIue (ui, va). 
Since Q; is not divisible by u; , there is a form C; of order zero, and an element 
a; ¢ §, such that 


1= aQi + Cu; ‘sis 
This may be written 
1 = (Cu; — a(B; — Qi)] + aB;. 


By (21), (25) and the definition of B; , moreover, 
(Cu; — a(B; — Qj)]-a:B; = 0 [A], 


provided J; =0 (u;, Ua), that is, provided P, = 0 or p, > 0 when z = u; (we 
refer to (20) and (18), and the fact that p, > 0). If this is not the case, then 


[Cui — a(B; — Qi)]-(aBi)’ =0 [Al], 


because u2B; = 0 is a consequence of uB; = 0, and the latter congruence 
holds by (21). 

Accordingly, we separate the singular solutions of A into two classes. A 
singular solution y = 7 of A will be said to be of the first class if in A, considered 
as aforminz = y — 9, the coefficient of z} is divisible by z; a singular solution 
which is not of the first class will be said to be of the second class. 

We suppose the singular solutions »; ordered so that m,--- , 7, are of the 
first class, and 41, +++, m» are of the second class. This is the same as sup- 
posing that 


(26) P, = or po >O when z= u, 
P2,#0 and p=0O when z= 4, 
We let 
M; = Cu; — a(B; — Qi), 
= (Cu; — a(B; — QP 
4. Cu; = a;(B; ws Qi)]-a:B; ’ v 

= a,B;, 

;= (a,B,)’, 


* It will be observed that a; = 0. 
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Then M; « {us}, Nie {Zo}, M,+ N; = 1, MN; «[A], 7 = 1,.-- ) 8. Letting, 
therefore, 


M=M,.--M, 
N = (M+ ™,)---(M.+N,) — M---M,=1-M, 


we see that (24) holds. 

We refer now to the paper cited under footnote 5. The proof in that paper 
shows that 2) = [A, N]. But, by (28), VN = 0(N;,--- ,N.), and N; = MN;+ 
NN; = 0 [A, N}. Hence Zo on [A, Ni, Pires , N,]. Referring to (27), we see, 
finally, that 


(29) 2) = [A, Bi, ---, B,, Bra, -++ , Bil. 


(28) 


The product representation of A is given by (14), (15) and (29).” 


9. Continuation 
The purpose of this section is to prove that 
(30) ‘ (eR Bee 
By (29) it suffices to show that 
(31) BuiBri =0 (2), of r=1,++-,8 §59=0,1,2,--. 
Let Iz + J represent any B, = I,ti2 + J... We shall prove that 
(32) (Iz + J)(Iza+J);=0 (2) 


for all z, 7. 
From (21) we see that 


(33) zla+J)=0, aUle+J)=0  [Aj. 


Differentiating the second congruence here | + 1 times, we find that 
+1 


(34) VP y . ') Z4i(lze + J)» = 0 [A]. 


A=0 


By (29), (32) holds for i = 0,7 = 0. We assume that (32) holds for 7 = 0, 
1,--- ,l — 1 andj = 0, and prove it fori = 1,7 = 0. 

Multiplying (34) by Iz + J, we obtain, using (33) and the induction 
assumption, 


(+ lje(Ize + J)(Im+J)=0 (2%). 
But Ia(Iz~ + J) = — J(Iz@+ J), (20). Hence 
JImz+J)(Ia+J)=0 (2X). 





7 Actually, 2» may not be connected. This will occur if A has more than one factor of 
order unity. 
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However, by (20) and (19), J is congruent, modulo (z, z:), to a nonzero element of 
3. Hence, by (33), (Ize + J)s(Ize + J) = 0 (2). 

Thus (32) holds for all 7 when 7 = 0. 

To prove (32) for all 7 and all 7 we again use induction. Assume (32) to be 
valid for all¢ and forj7 =1— 1. Differentiating (32) for 7 = 1 — 1 we see that 


(Ize + J)inn (ae + J)ii + Ua + J)iUa + J): = 0 (Xo), 


so that, by the induction assumption, (Iz + J);(Iz + J); = 0 (2). This 
proves (32) for all ¢ and 7. 

Thus (31) holds whenever o = r. To prove (31) in the general case, consider 
B,iB,;. It is sufficient, of course, to suppose that o > r, for B, e X) if ¢ Sr. 
Now, the remainder of B,; with respect to B, is a form in {Zo} of order not ex- 
ceeding unity. Such a form must be divisible by A. Hence, for an appropriate 
integer m, 17 B,; = 0 [A, B.]. Hence, since (31) holds when 7 = o, I7B,;B,; = 
0 (X). But ¢ >+7, so that (see (26), (20) and (18)) J, is congruent modulo 
(Ue , Mei), to a nonzero element of §. Hence, by (21), B.iB,; = 0 (2). 


10. The regular type 
We shall say that A is of the regular type if 


(35) 1=0 (%,S,h,---, 1). 


In the present section we assume that A is of the regular type. 

We shall prove that the exponent of [A] is 1 or 2 according as all of the singular 
solutions of A are of the first class or at least one singular solution of A is of the 
second class. 

(a) Let all the singular solutions of A be of the first class, and let G ¢ {Zo}. 
We shall show that G € 2. : 

For each i we have, for an appropriate integer b; , I/‘G = G;[B,], where G; 
is of order unity, at most. Since G; € {29}, G; must be divisible by A. Hence, 
fori = 1,---, s, °°G = 0 [A, Bj. Similarly, for a suitable c, SG = 0 [A]. 
Now, by (35), there are forms X; , and a form T' ¢ 2 , such that 


1= 7+ XS + Xi! Pe Ds ne 


Therefore, 
G=(T+X8S+XJ+.-.-.-+X02)G=0 ([A,B,---, Bl. 


But all the singular solutions of A are of the first class. Therefore, by (29), 
%=[A,B,,---,B,]. Hence, G ¢ Xp. 
Thus, the exponent of X)is unity. But the exponent of 2; = [ui],t = 1, ---,8, 
is obviously unity. Hence, by Theorem 1, unity is also the exponent of [A]. 
(b) Let A have at least one singular solution of the second class, and let 
Ge {Xo}. Asin (a) we see that G e[A, Bi, --- , B,J. This, together with (31), 
proves that the exponent of 2» does not exceed 2. Hence the exponent of [A] 
8 No greater than 2. 
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To prove that the exponent of [A] is exactly 2, it suffices to exhibit a form in 
{A} which is not in [A]. To this end, we let 2 be any u; with 7 > r, and show 
that {2} contains a form z, + H, where h is a sufficiently large positive integer, 
and H is a form, of order less than h, contained in [z]. 

By (35) there is a relation 


(36) 1=T+XS+Xh+--- + Xd, T €X. 


Let h(= 3) be an integer such that h — 1 exceeds the orders of T, X,, --- , X,. 
If we differentiate each B; , written as a form in z, h — 3 times, we obtain relations 
Izgea + Ki = 0 {20}. Similarly, differentiating A h — 2 times we obtain 
Szr1 + Ko =O {2}. All K; are forms of order less than h—1. These rela- 
tions, together with T'z,1 = 0{ Zo}, yield, because of (36), a relation z,_, + K = 
0 {2 }, where K is a form of order less than h — 1. 

Let @ be the term of K free of z and the derivatives of z. If a = 0, then 
(2x1 + K): is a form z + H as described above; if a ¥ 0, then a(a“(z,_1 + K)); 
is such a form. 

Now let £ be the product of all the u; other than z, and consider E(z, + H), 
where z, + H is the form whose existence we have just proved. Wesee that {A} 
contains a form Ez, + F, where £ is a form of order zero, not divisible by z, 
and F is a form of order less than h. 

We shall now complete the proof by showing that Ez, + F A O0[A]. The 
method will be to assume that Hz, + F = 0 [A], and to force a contradiction. 

Let then 


(37) EFa+F =CQA+CiAi +--+» +CiA:, 
where A; represents the j-th derivative of A. By (17), for alli > 0, 
Pod; = (a(Uze + V))i-an (A, Ai, «++, Ai) 
It follows from (37), therefore, that 
Po(Ez, + F) = DA + Dyzx(Uz + V) 
(38) + D2(2(Uee + V)1 + «(Uz + V)) 


t—1 


+-.--+D>D ( Be Y es(Ua + V)--+ 
i=0 u 
We now consider the various forms in (38) as polynomials in the z;. We shall 
modify (38) in such a way as to make evident a contradiction 
Consider a power series” be Bzi in z, such that A vanishes when 2BAi 
is substituted for z. Such a power series exists, as by (13) 2, = 0, z = O renders A 


0A 29 
zero and leaves gz Ronzero. 
Z 





%8 All power series which we mention are formal. 

* The “implicit function theorem” for abstract fields of characteristic zero may be 
proved by the method of undetermined coefficients. The proof is simpler than the one 10 
analysis, because the series is formal and no questions of convergence arise. 
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Substituting 28,2} for z in (38), we find a relation 


E'y + F' = Diai(U'ze + Wo) + D3(2(U'es + Wi) + za(U'z, + Wo)) 
(39) rea ft-—1 

+---+D, x ( i )esn( U2 + Wi-14). 
Here FE’, F’, U’, the D; and the W; are power series in z; with coefficients which 
are polynomials in the z; with i > 1. Each W, is free of z; with 7 > i + 1. 
Moreover, Z’ and U’ involve only z,, and have nonzero terms of degree zero. 
For E’ this is true because P}E is not divisible by z; for U’ because P, ~ 0 and 
m = 0 (zis a wu; with ¢ > 7; see (26) and (18).) 

Let 


(40) W= 4%, w; = U'2e; + Wis, += 2,3,---. 


Since U’ has a nonzero term of degree zero, the equations (40) may be inverted 
to obtain 


(41) 4 = WwW, a= Yu;+ Z;, t= 2,3,---, 
where Y is a series in w, with a nonzero term of degree zero, and each Z; is a 
series in w; with coefficients which are polynomials in w, , --- , wi. The equa- 


tions (41) define a substitution, with inverse substitution (40). Applying sub- 
stitution (41) to (39), we obtain a relation 


Qu, + R = Lywyw, + Le (wiws + (Yur + Z2)we) 
+ eee + Li(wywiy + (6 — 1)(Yur + Ze)wi + --- + (Yur + Zur), 


where Q is a series in w, with a nonzero term a of degree zero, and R is free of 
the w; with 7 = h. 

For the second member of (42) to produce the term aw, which appears in the 
firs; member, some L; with ¢ = h must have a nonzero term of degree zero, 
because each term in the coefficient of L; is of at least the first degree in 
W,W.,---,w;. Thus, some of the LZ; have a term which is simply an element 
of §, and hence certainly terms bwi(b « §, b = 0,7 = 0). Let L, be the L; of 
highest subscript, with this property. Then p = h. Of all the terms of L, of 
the above mentioned type, let cw} be the one of least degree. Then 


Ly (wywpy1 + +++ + (Yup + Zp)wr) 


contains the term CWi Wir. This term does not appear in the first member of 
(42), and must therefore be cancelled by terms in some expression 


(43) Lj(wywigs +--+ + (Ywx + Z;)we) 


with i * p. But if i < p, then each term of (43) is divisible by some w; with 
2Sj <p. Hence cwi ‘'w,4: must be cancelled by terms from expressions (43) 
withi > p. This implies that some L; with i > p has a term of the type but? 
(be §,b #0). This contradicts the definition of L, , and completes the proof. 


(42) 
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The following example shows that of the forms A with constant coefficients, 
which have singular solutions, all of multiplicity unity, those which are not of 
the regular type are exceptional. It also shows that, for any preassigned pair of 
integers r, s (0 < r S s, s > O), there exist forms of the regular type having r 
singular solutions of the first class and s — r singular solutions of the second class, 

EXAMPLE 2. Let 1, s, m, n be integers with 0 Sr Ss S m,s>0,n 23, 
letu; = y—ni,t=1,---, 8, wherem, --- , 4. are distinct constant elements 
of a differential field € of characteristic zero, and let 


m—s m—s 


= Lowy, A= Lou, e= L aay, 


Q = L awy’, vy = 3, -++,N, 
= 


where doo , 1, -** , Gmn is a set of constants, algebraically independent with 
respect to €. Consider the form 


A = Qou +++ ue + Qrur +++ wets + atm +++ wey + 2 Qui 


with coefficients in § = (a, dor, «++ , @mn)- 

We shall show that the singular solutions of A are m , «++ , 7, Where m, +++, % 
are of the first class, and 7,41, --- , 7, are of the second. We shall prove, more- 
over, that A is of the regular type. In fact, we shall prove the stronger result 
that 


(44) 1=0 (A, S8,hi, +++, Le, Ji, +++, de). 


Let R be the resultant with respect to y, of A and S. R is a polynomial in y 
and the a,,. If we let the a,, take on special values, such that A becomes 
uy +++ Us + y}, then S becomes ny}, so that R becomes n"(wm --+ Us)" ¥ 0. 
Since £ is not 0 for special values of the a,, , R does not vanish for unknown dy . 

Now, each singular solution is a solution of R, which is of order zero and has 
constant coefficients. Since R ~ 0, this implies that all the singular solutions of 
A are constants. Hence, from the expression for A and the expression for S 
derived therefrom, we see that a singular solution of A must annul Qow --- % 
and Qiu --- us, that is, must be one of the 7;. That each 7; is a singular solu- 
tion is obvious. Thus, the singular solutions of A are m,---, 7. [tis appar- 
ent that these are of multiplicity unity, and that 7; is of the first or second class 
according asi S rori>r. 

We find, using (20) and (18) that 


qT; => Qo(u cee Uj-Ui41 °° Us) I, J; => (Qo eee Up Uiqi -*° us) J, (A), 
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where 


[= — hus ++ te = QQaes ++ wen — QD Qui 
+ 2QoQ2u +--+ ur + Qo > vQ, yi, 


J = (Qots +++ ts)’ + (Qiu +++ ts)’yr + (Qour +++ up)’yt + > Qyi, 


accents indicating differentiation with respect to y. Hence, 
(A, S, I, ig gaa, Ji, ae » Je) ag (A, S, Qol, QJ). 


Suppose, now, that (44) is false. Then there is a pair of values y = 9, y: = 92 
which annul the polynomials A, S, QJ, QJ. When y, = 0, A, S and Q3/ 
become, respectively, Qow: --+ te, Qiu --- Us and Qi(Qou --- u,)’, and these 
polynomials have no solution in common. Hence g; ~ 0. Also, the resultant. 
with respect to y; of S and 


Se — Qo +++ Us 
Y1 





Qit +++ Us + Qos +++ UY + > Qyr7 


is a nonzero polynomial in y, relatively prime to Q .” Hence @ is not a root of 
Q = 0, so that g, 7, annul J and J. 

Let M be the resultant with respect to yi , N the resultant with respect to y, 
of J and 


nA — yS = nQous +++ Us + (n — 1I)Qim--- Uy 


n—1 


+ (mn — 2)Qow +++ uryi + > (n — »)Q,yi- 


M is a polynomial in y, N is a polynomial in y; , each with coefficients which 
are polynomials in the a,, not including don. Moreover, M and N are not 0." 
Now, j, 7: annul M and N. Thus, g and 4 are algebraic functions of the a,, 
other than ap,.. Since A vanishes for y = J, y1 = 91, and since 9 = 0, it follows 
a a, is an algebraic function of the other a,,. This contradiction completes 
the proof. 





* This resultant is nonzero because it is nonzero when A = y? + u --: Usy1; itis prime 
to Qo because its coefficients are independent of aoo, --* , @m—0 - 

“If we specialize some of the dy, so that A = yt + yt! + Qoui---u., then 
J= (Qou; --- us)’, nA = ys = NQou; --+ Us + yi, and M = (Qnui; --- us)’ #0, so that 
M does not vanish before we specialize. If we specialize A further by setting y: = 0, then 
J = (Qour +++ us)’, mA — yrS = nQour -++ u,, and N is the resultant of nQou: --- us and 
(Qou: --- u,)!, which is not 0. Thus, N is not 0 before the specialization. 
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Exampte 3. Letn = 2, and let A = Qyi + P, where P and Q are relatively 
prime nonzero forms of order zero with constant coefficients, P being of positive 
degree and without multiple factors. The singular solutions of A are the roots 
of P = 0, and are all of multiplicity unity. 

If z = y — n, where 7 is a root of P = 0, then m = 0, and we may write 
A = (P/z)z + Qet. Thus, referring to (18), we see that 


U = n(P/z)Qzer”, 
V = —(P/z)'Qat + (P/z)Q’at + (P/z)’. 
Hence, if m, --- , 7. are the roots of P = 0, then (see (20)) 


PM: 
I; =n ) yi 
ni 


y 
P P MoS P). Agia Ps ¥ 
ee ( (F.)or +? ow) +(7 J. 


Since the roots of P = 0 are distinct, it follows that Qy?* = 0(1,, --- , I,), 
so that P= 0(A,lh,---,J,). Since Pand Q are relatively prime, then, y~ = 
O(A, Ii, --- , Is), so that 


P 3 

( ) =0 (A,h,---, Ie, J1,-++, ds), ¢=1,---.8, 
Yn 

From this it follows that 1=0 (A,h,---,JI,, J1, +++ , Js), 80 that A is of the 

regular type. 

The coefficient in A of y; is divisible by y — 7; if and only if n > 2 (in which 
case the coefficient vanishes). Thus, the singular solutions of A are all of the 
first class or are all of the second class according as n > 2 or nm = 2. Conse- 
quently, the exponent of Qyj + P is 1 or 2 according as n > 2 orn = %. 

Exampte 4. Let A = xy} — 2yy: + y, where z is any element whose deriva- 
tive is 1. Then S = 2(ry, — y), so that the singular solutions of A are y = 0 
and y = x. We may write A = z(y, — 1)? — 2(y — z)(y: — 1) — (y — 2). 
Both singular solutions are of multiplicity unity, and are of the second class. 

If we let m = 0, m = 2, we have, referring to (20) and (18), hh = 
2(ry: — 2y + x), I, = —2(xy, — 2y). Hence 1 = (J, + J2)/2z, so that A 
is of the regular type, and the exponent of [A] is 2. 

The following example shows that not every form with singular solutions, all 
of multiplicity unity, is of the regular type. 

ExamPLe 5. Let A = yj + yj + P, where P is a form of order zero, with 
constant coefficients, and with distinct zeros m, --- , 7». ‘The singular solutions 
of A are m, --- , 7, all of multiplicity unity and of the first class. S = 4yi + 
3yi. By (18) and (20), 


ee 
i= ( ) Aye + 
y "i ( Yi 341), 
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n=? (-(4,) nat+ 0 +(,)) 
e eles) - (; ea) bf =, ctl 


It follows by (29) that 2» = [A, B] = (A, [B]), where B = (4y? + 3y:)y2 + P’. 
Hence 


(,S, hi, aie , I.) _ (yi + yi + P, 4yi + 3y:, B, Bi, Be, wee ) 
= (256P’ — 27P, 9y, + 64P, B, B:, Br, «++ ) 
= (256P — 27, P’, 9y: + 64P, B,, Be, --- ). 


Thus A is of the regular type if 256P — 27 and P’ are relatively prime. How- 
ever, if 256P — 27 and P’ have a factor in common which is relatively prime to 
P”, then A is not of the regular type. This is easy to see from the congruences 


By = (4yi + 3y:)ys + By + 3)y2 + Py: 
= —3y3 — 3P” (256P — 27, 9y: + 64P), 
By = (4yi + 3y:)ye + 3(8y + 3)yays + ++ 
= —9yy; + He (256P — 27, 9y; + 64P), 
By = —3(k + 1)yyenr + Mi (256P — 27, 9y: + 64P), 
where H; is of order not exceeding k. 


11. Case in which no singular solution exists 


In this section we suppose that A has no singular solutions. By the analog 
of the Hilbert-Netto theorem this means that 


1=0 (S, Si, wa , Sp, A, Ai, pget: , Ag) 


for sufficiently large integers p and gq. 
We shall consider only those forms A for which p may be taken as 0, that is, 
for which there is a g such that 


(45) 1 =0 (8, A, Ai,---, Ag) 


We prove that if (45) holds, than the exponent of [A] is unity. 

Indeed, let Ge {A}. It is required to show that Ge[A]. Performing a 
partial reduction of G with respect to A, we see that there is a positive integer 
j such that S’G = G@’[A], where G@’ is of order less than or equal to unity. 
G’e{A}, and hence is divisible by A. Therefore S’G = 0 [A]. Of course, 
AG = 0 [A],i = 0,1,---,¢. By (41), however, 1 = 0 (S’,A, Aj, +++, Ag). 
Hence, G = 1.4 = 0 [A], qed. 

The following example shows that a form A, which has no singular solution, 
for which (45) fails to hold, is exceptional. 














764 ELLIS ROBERT KOLCHIN 


Examp_e 6. Let § be the differential field obtained by adjoining the un- 
knowns a,, (u = 0,--- ,m;v = 0,--- , n) to some differential field of charac- 
teristic zero. We consider in §{y} the form 


A= yo Zz, Aw Yy Yi. 
p=0 v=0 

We shall show that A has no singular solutions, and that (45) holds, with q = 1. 

A; is linear in yz. Let A; = Sy2+ 7. Wemust prove that 1 = 0(A, S, 7). 
Suppose 1 40(A, S, 7). Then there is a solution y = 9, y: = 9 of the poly- 
nomials A, S, T. Now, g and g, must annul, respectively, the resultant with 
respect to y, and the resultant with respect to y, of Sand T. These resultants 
are nonzero polynomials in y and y;, respectively, with coefficients which are 
polynomials in the a,, and a, , not including ao. It follows, since y = 7, 
Yi = 1: is a solution of A, that a is an algebraic function of the other a,, and 
the a,:. This contradiction completes the proof. 

There exist forms A for which (45) is satisfied for some g > 1, but for which 
(45) is not satisfied if g = 1. This is shown by 

ExampLe 7. LetA =1+2y,+ (1—y*)yi. Then S = 2+ 2(1 — yn, 
and A has no singular solutions. Now, yi + 1 = A — (y:/2)S, so that 


(S, A) = (y’, ym. + 1). 
Also, Ar: = —3y’yi + 2(1 + (1 — y*)y)y2 = 3y°(S, A). Hence 
(S, A, Ai) = iy’, yi + 1). 
Again, A: = —6yyi — 15y"yiye + 2(1 — y*)ye 


+ 2(1 - (1 sa y*)ys)Ys = —6y + 2y2 (S, A, Aj), 
so that 


(S, A, A; ’ Ae) ai (y’, Y1 + 1, y2 4% 3y). 


Finally, As = —6yi — 54yyiye — 36y’yy? — 2ly’yiys + 6(1 — y*)yoys + 
2(1 + (1 — y*)ys)ys = 6(1 + 9yye — yays), (S, A, Ar, Az), 80 that 1 = yo(ys — 9), 
(S, A, Aj, A2, A3), and 1 = y2(Yys 533 9y)* = 9y’ (ys ta 9y)* =0 (S, A, Ai, Aa, As). 
Thus, (45) holds for g = 3, but for no lower value of q. 


12. The intermediate ideals for hyperelliptic forms 


We have seen in §10 that if A has singular solutions, all of multiplicity unity, 
at least one of which is of the second class, and if A is of the regular type, then 
[A] is properly included in {A}, and has exponent 2. It is natural now to ask 
what differential ideals there are between [A] and {A}, that is, what differential 
ideals A there are such that [A] C A © {A}. We shall call such a A an inter- 
mediate ideal. In the present section we answer this question completely for 
the forms of Example 3 with n = 2. 

We are dealing then with a form A = Qyj + P, where P and Q have constant 
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coefficients, are of order zero, and are relatively prime. P is assumed to be of 
positive degree, relatively prime to P’ = =P. Such a form A we shall call a 
hyperelliptic form. 

Our first task will be to find a suitable basis for 2p . 


Let m,--:, m» be the singular solutions of A, that is, the roots of P = 0. 
We have seen that 


(46) 2 = [A, (Tye + Ji)’, p ate (Leys + J.)'),” 

where 

(47) y—-a)Zy+J)=0, wlye+Ji) =0 ~ [A], 
and 

(48) 1=0(4,h,---, Is). 


Congruence (48) shows that there is a form K, with constani coefficients and 
of order less than 2, such that 


(49) yo + K =0(A, liye + Ji, --+ , Ley2 + Js). 
We shall show that 

(50) 2 = [A, (Ys + K)’I, {Zo} = [A, Y2 + K], 
and that 


(51) (y—m)+--Y— m)(y2e+K)=0, wlyet+K)=0  [Al. 

Now, (51) is obvious from (47) and (49). As to (50), it is evident from (46), 
(49) and the fact that 2» is of exponent 2, that [A, (yz + K)*] © 2». Toprove 
the inclusion in the opposite direction, consider any Iiy2 + J;. We have 

Tye + Ji = Iilye + K) + Ji — IK. 
Hence J; — I;K € {Xo}. Since the order of J; — I:K does not exceed unity, 
this form is divisible by A. Thus, 
I: y2 + Ji = 0 (A, ye + K), 
so that 
(Iy2 + Ji)’ = 0[A, (ye + K)'I, i= 1,---,8. 


This, in conjunction with (46), shows that 2» © [A, (ys + K)*], and completes 
the proof of the first part of (50). The second part of (50) is now obvious. 
The congruence (50) gives the basis sought for 2). We have, as a result, 





as a P has constant coefficients the »; are constants, so that (y — 21); = Yi, 
‘= 9°, 8 J wid o--, 
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(52) [A] = [P]-[A, (2 + K)'l, {A} = [PI]-[A, ye + Ki, 


for P is a constant times the product of the y — 1.” 

If we refer to (51), the first intermediate ideals which present themselves are 
of the type 
(53) [P]-[A, (ye + K)’, y — m4) +++ Y — 14,)(y2 + KI, 
where 7;, --- , t, are distinct integers from 1 to s. If g = s, the ideal (53) is 
[A], if g = 0, (53) is {A}. 

We shall show that the ideals (53) are distinct from one another, and that every 
intermediate ideal is one of the ideals (53), that is, the ideals (53) are the only 
intermediate ideals. 

To prove that the ideals (53) are all distinct, it suffices to show, h = 1, --. , 8 
that 


[P]-[A, (ye + Ky’, (y ore Ni;) es (y aa} Nin) (Y2 + K)] 
# [P]-[A, (ye + K)’, (y — ni) ++» Y — mH) + KD) 


? 


For, if 


[P]-[A, (yz + K)’, (y — ni) «++ Y — mH) Y2 + KD] 
and 


[P]-[A, (ye + K)*, (y — 4) «++ Y — 1%) Y2 + KD] 
were equal, they would both equal 
[P]-[A, (ys + K)’, (y — m) +++ (Y — m)(y2 + KDI, 
where k, , --- , k; are the integers common to 7, --- , % and ji, --+ , jy ; andif 
[P]-[A, (ye + K)’, (y — m4) +++ (Y — nye) (Y2 + KD] 
= [P]-[A, (ye + K)’, (y — nu) +++ Y — M4) (2 + KD) 
with d > 1, then we would have 
[P]-[A, (ye + K)’, y — m,) +++ Y — mG + K) 
= [P]-[A, (ye + K)’, (y — 24) +++ (Y — mye + KI 
For h = s this means we must show that 
[A] c [P]-[A, (ye + K)’, (y — mi) +++ Y — mr) + KD 
for all distinct 7;,---, 7,1. This will be proved, however, if we show that 
(54) (y — m1) +» Y — Mya + Ki) #0 [AI.” 


for the first member here is in 





33 See (10). 
* K; is the j-th derivative of K. 
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[P]-[A, (ys + K)’, (y — ni) «++ (Y — i,-.)(y2 + KDI.” 
The relation (54) may be established in the same way in which (37) was 


disproved. 
From (54) it easily follows that 


(y¥— m) ++ Y— m%-)Y2+K) AO (2%). 
For h < s we proceed by induction. Assume that 
(y — mi) + Y — MG —-DY — Mijas) -+- Y — Tans (Ye + K) 
0 (A, 2 + KY, Y — m4) ++ &Y — naarde + KD] 


for all distinct 7, --- , tay, andallj = 1,---,h +1. (Forh = s — 1 this has 
just been proved.) 
Suppose, say, that 


(y— m) «++ (y¥ — ma)(y2 + K) 
= 0 IA, (v2 + K)’, (y — m)--- y — m)(y2 + K)). 
Then, for ¢ sufficiently large, 
(y — m) +++ (y — ma)(y — m)‘(y2 + K) 
= 0 [A, (ye + K)’, (y — m) +++ Y — muy + K)), 
Combining this congruence with 


(y — m) +++ (Y — my)(y2 + K) 
= 0 [A, (yve+ Ky)’, (y — m) -+- Y — mu)(ye + K)I, 








(55) 


















we see that 









(y— m) +++ (Y — may — m4) (ye + K) 


=0 [A, (+ K)’, (y — m)--: Y — mu) + K)I. 
This contradicts (55), and completes the first part of the proof. 
As for the second part, let = be a differential ideal between 2» and {Xo}, 


distinct from {2}. We shall complete the proof by showing that = coincides 
with some ideal 


(56) [A, (ye + K)’, (y — m4) --- (Y — 1,)(y2 + K)I. 



















* The first member of (54) is in [P] because it is in [y:] and y: « [P]; the first member of 
(54) isin (A, (y* + K)*, (y — mir) «++ (Y— niy—1)(y2 + K)] because ((y — nix) +++ (y — née) 
e+ K)) = (y= nes) ++ Y= niena))'v(¥e + K) + Y= 9) “+ Y— 1a) + KD) = 
(Y — nix) +++ (y — migng)(ys + Ki), (Zo). Moreover, the ideals [P] and [A, (y2 + K)?, 
y — 71) +++ (y — nig-1)(y2 + K)] are separated, because P and y: + K have no solution 
in common. 
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Let q be the smallest integer for which some ideal (56) with g = q is included 
in >. q is an integer from 1 to s. Suppose, say, that 


sz 
where 

A = [A, (ye + K)’, (y — m) «++ (y — ned(ye + K)l, 
but that 
[A, (ye + K), (y — m) +--+ (y — ni-adly — tins) --- (YY — ng)(y2 + K)] ¢ Zz, 
4 = 1,---,q. We shall prove that 2 = A. 


Assume to the contrary that Fe =, F ¢A. Since Fe {2} = [A, yo + Kj, 
we may write 


(57) F = QA + Cil(ye + K) es Ci(yers + Ki). 


Let D, be the remainder of C; with respect to y, + K. Since (yis: + Kis) 
(Yr + Ki) =0 (A), t= 1, 2, -¢* 5 have, by (57), 


(58) F=Ciye+ K) +--+ Culye + Kee) + DY + Kin), — (A). 


D, is a polynomial in y, y;. Let EZ; be the sum of the terms of D; free of y: . 
Since, by (51), (yi(ye + K))iu = 0 (A), it follows that 


(59) mi(yen + Kin) = —(t — Dycl(ye + Kee) — --- — yelye + K), (A). 
From this and (58) we see that 
(60) F=CiQye+K) +--+» + Cin(ye + Kee) + Bilyei t+ Kix), (A), 
where the C; are new forms. 

Let H, be the remainder obtained on dividing E,; by (y — m) --+ (y — %)- 
Since ((y — m) +++ (y — m)(ye + K))in = 0 (A), 
(y — m) +++ (YY — ne) (Yer + Kir) 

= —( — 1)(y — m) «++ (Y — 14))1(Ye + Kea) 
— += (Y— m) +++ (Y— a: + K), (A) 

Hence, it follows from (60) that : 


(61) F=Ci(y+K)+--»-+ Chily + Kes) + Ayu + Kea), (A) 


where the C;’ are new forms. 
Treating the first ¢ — 1 terms in the second member of (61) as we did the 
second member of (57), we obtain, after ¢ — 1 more steps, a relation 


(62) F = Aily2+ K)+--- + Ayu + Kix), (A), 


where each H; is a form of order zero and degree less than q. 
At least one H; must be nonzero, else F eA. We may (and do) suppose that 
H,; # 0. 
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From the congruence y:(y2 + K) = 0 (A), we obtain relations 
(63) ! vii + Kin) =0 (A), ga tet, 
Therefore, if we multiply (62) by yi we find 

Aiyi Youit+ Kin) =0 = (2). 
Now, by (59) and (63), 


yi (yin + Kia) = —(9 — Uyt yey; + Kis) 
= (j— ly "Kyi + Kis), (A), 


ded 








so that 





yi (ya + Kea) = (¢— 1)IK'"(y2 + K),o (A). 





Hence, 
H.K’"(y2+K)=0 (2). 
If L is the sum of the terms of K free of y:, we find, since y:(yz + K) = 0 (2) 
H.L'"(y2+K)=0 (2). 
But (y — m) --+ (y— ng)(y2 + aa =0 (2). Hence, if J is the greatest common 
divisor of H.L** and (y — m) --- (y — 1g), 
J(y2 + K) = 0 (2). 
But L is prime to (y — m) --- (y — ng), for otherwise the ideals [y — ml, - 
ly — nl, [A, ye + K] would not be separated. Hence J is a divisor of H.. ty re 
of degree less than g, and is a proper divisor of (y — m) --- (y — 1). This ba 
contradicts the definition of g as the least g for which = includes an ideal (56), i 
and completes the proof. 
The results just established show that the ideals between [A] and {A} can be 
organized into ascending chains, all of the same length s. That is, if Ai, ---, 
Ay. are intermediate ideals such that 


[AJC AC +» C Any Cc {A} 
(chain of length h), then there are s — h other intermediate ideals which to- 
gether with [A], Ai, --- , Asa, {A} form a chain of length s. Moreover, there 


exist no chains of length exceeding s. This illustrates a phenomenon which 
we shall discuss in Part III. 















) 






Part III. CHarns 


13. Generalities 


We return now to an abstract differential domain of integrity with basis 
theorem, which contains the rational numbers. 
The n + 1 differential ideals oo , 01; «++ , n-1, o* are said to form a chain if 


(64) o Co Ceres Conn Co. 






le 
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n is called the length of the chain. We shall call o and o* the end 
ideals, 01, +++ , On. the intermediate ideals, of the chain. The ideals o;_,, g; 
(also, on-1, «*) will be said to be contiguous. A set of ideals of the chain will be 
called non-contiguous if no two ideals of the set are contiguous. A chain is 
called a principal chain if no pair of contiguous ideals have a differential ideal 
properly between them. A principal chain may be expressed schematically by 
means of dashes: 


0 — 01 — +++ — Onan — of. 


The notation ¢ — 7 means that o C 7, but there is now witha Cw Cr. A 
chain with end ideals oo , o* is said to be a refinement of the chain (64) if each 
intermediate ideal of (64) is an intermediate ideal of the chain in question. 

Let oo, 01; +++, On-1, o* be a principal chain, and let there be a second 
principal chain oo , 01, -++ , Gi-1, Ti, Citr, -** » On-1, O* With o; ¥ 7;, that is, 
schematically, let 

0; 
(65) yor 1 6K Doig — 2 — ona — 0 
} T? 
We shall say that. the first chain is flexible at the intermediate ideal o; , and that 
the second chain is obtained from the first by a flex. 

The first theorem on chains that we shall state is that of Jordan-Hélder- 
Schreier. A differential domain of integrity D forms a group under addition. 
As a set of operators on this group we may consider 1. multiplication by an 
element of D, 2. differentiation. The permissible invariant subgroups of this 
group with operators are precisely the differential ideals of D. Thus, we may 
state 

THrorEM 7 (Jordan-Hélder-Schreier): Two chains with the same end ideals 
have refinements of equal length, with differential rings of remainder classes which 
are isomorphic (in some order). 

Proor: See van der Waerden, Moderne Algebra, vol. 1, 2nd edition, §46, or 
H. Zassenhauss, Lehrbuch der Gruppentheorie, vol. 1, chapter 2, §5. 

As a consequence of Theorem 7, we see that all principal chains with the same 
end ideals have the same length. 

THEorEM 8 (Ore): T'wo different principal chains with the same end ideals, 
can be obtained from each other by a finite number of successive flexes. 

Proor: See O. Ore, On the foundations of ahstract algebra. 1, Annals of Mathe- 
matics, vol. 36 (1935), pp. 406-434. 


14. The exponent of a principal chain 


By the exponent of the chain (64) we shall mean I,so). We shall be concerned 
with principal chains, exclusively. Of course, if o* & {oo}, then the chain (64) 
has exponent ©, so we may limit ourselves to chains for which o* © {0}: 





% By the differential ring of remainder classes of a chain (64) is meant the differential 
rings 01/00, 02/01, --- , o*/on-1. 
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Our chief result may be stated as follows. 
TurorEM 9: Let ’ 


(66) Go — 01 — +8* — Ona —o*, o* & {ao}, 
be a principal chain which is flexible at r non-contiguous intermediate ideals.” 
Then the exponent of the chain (66) does not exceed n — r + 1. 
As a consequence of Theorem 9, we see that if a differential ideal o has finite 
exponent p, then there is a chain, with end ideals o and {co}, of length p — 1. 
Proor: We begin by showing that, if 7 — 0; , and o, © {oo}, then 1,,09 = 2. 
Indeed, let 1,00 > 2. Then there is a b € 0; such that b’ ¢00. Hence 


o Clo, bo] Ga. 


Since o) — o1, we see that o; = [oo, b’]. In particular, b « {oo , b’), so that 
b = A (op), 


where A is a form in b with coefficients in D, which has no term of degree less 
than 2. Using the iterative process introduced in the paper cited under foot- 
note 19, we see that b = 0 (oo). This contradiction shows that 1,,09 = 2. 

We now show that the exponent of the principal chain (66) does not exceed 
n+1. 

For chains of length 1 we have just given the proof. Let > 1, and assume 
the result for chains of length less than n. 

By the assumption, o*” © o,. Suppose o*”* € o. Then 


a & (am, o*"") Gan. 
Since o) — 01, we see that o,; = (0, c*""’). Hence 
ott ie oto*” = o*o; = o* (ao , o*”*) 
S (a, o*"™) & (00, o**o1) 
<S (o, (oo, o*"**)) = (a, * eee. 
S++» S (0, o*") S (a0, oi) So. 
This contradiction proves our statement. 

We are now ready to complete the proof of our theorem. What we have just 
shown is that the theorem is valid for r = 0. We proceed by induction. Let 
r > 0 and suppose the theorem holds for fewer than r non-contiguous ideals. 

Let o; be the intermediate ideal of (66) of lowest subscript, at which the 


- is flexible. Then we have, with o; ¥ 7;, the situation depicted by (65). 
ow 


Oi — Citi — °° — Gn-1 — o* 








* Clearly, r is an integer between 0 and the greatest integer in n/2, inclusive. 
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the chain is flexible at two intermediate ideals. Yet its exponent is 3. 
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and 
Fé. 7 Ciqg 292 "See o* 


are both principal chains of length n — 4, flexible at r — 1 intermediate ideals, 
Hence, by the induction assumption, o*”*"" € o;, o”*"? Cz,. Bu 
on Gor Co; and oi, — o;, so that oy = oN 7;. Therefore, 

4n—t—r+2 
oO od G1 ° 

Suppose, now, that o*”"" € o. Let ox be the o; of least subscript, such 
that ot” *"*"" Co;. Thenl S$ k Si — 1 and 

ot Co: grr ¢ 3) gant a aS 

Hence 


—k—r+2 
Tk-1 S (on—-1 ’ o*” * ) c Ok, 


. —k—r+2 
so that, since ox1 — ox, oc = (o%-1, o*” ). Thus, 
—k—r+2 —k—r+1 
ee ae. knee, 


at o® (ona c genrrars) 


ius S (on-15.0°", ) = oma, 
This contradiction completes the proof of Theorem 9. 


Example 8. Consider the chain 


y"™, yil, ly”, yi), et" s [y’, yi), ly] 


in any differential ring §{y}. It is easy to prove that this is a principal chain, 
flexible at none of its intermediate ideals. The length is n’and the exponent 
is obviously n + 1. Thus, the bound on the exponent given by Theorem 9 is 
the exponent itself. 
The necessity of the non-contiguity condition in Theorem 9 is shown by 
ExamMPLe 9. Let § be a differential field of characteristic zero, which con- 
tains an element x with derivative 1. Consider the chain 


ly’, YY» yl, ly’, yi ’ yl, ly’, vil, [y] 
in §{y}. This is a principal chain of length 3. Since 


[y*, yi, yol 
[y’, yy, ni We y:] 
y’, wl 


and 


ly’, w] 
Ow te 


[y’, yi, yol [yl, 
Nv, Yi — : 14 
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AppENDIX. GENERAL SOLUTIONS AND RESOLVENTS 


Liquidating an obligation incurred in Part I, we give here an abstract, purely 
algebraic treatment of the subject matter of Chapter 2 of Ritt’s book, Dif- 
ferential equations from the algebraic standpoint.* We shall make free reference 
to this book, designating it by (#), and shall give proofs only when they differ 


from those in (R). 


1*. The general solution of a form 


We work in an abstract differential field § of characteristic zero. The letters 
u;, yi and w will denote unknowns. 

We establish the following fundamental lemma as a substitute for the exist- 
ence theorem for differential equations. 

Lemma 1*: Let A be an algebraically irreducible form in §{y,--- , yn} of 
positive class. Let B,, +--+ , B, be nonzero forms in Bim , +++, Yn}, reduced 
with respect to A. Then B, --» B,¢ {A}. 

Remark: Lemma 1*, ‘Seieiilliae with the analog to the Hilbert-Netto theorem, 
shows that every algebraically irreducible form of positive class has a regular 
solution, that is, a solution for which the separant S and initial J of A do not 
vanish. For otherwise JS would be in {A}. 

Proor: Let m be the class of A, r the order of A in y,. Suppose 
B,--. B,e {A}, that is, (B, --- B,)” [A], for some positive integer p. Now, 
A and (B, --- B,)” are relatively prime. Hence there is a nonzero form C, 
either of class less than m, or of class m and of order less than r in y» , such that 


C = DA + E(B, --- B,)’, 


where D and E are forms in §{yi1, --- , yn}. C €[A], so that we may write, for 
suitable q and Q;, 


(1*) C = QA + QiAi + --- + QoAg. 


Now, A; = Sym,r4i + T;, where 7; is a form of order less than r + 7 in ym. 
Hence, if in (1*) we consider the forms as polynomials in the letters y;; , and let 


Yarn = ——=, i=1,2,---, 


we see that S"C is divisible by A for a sufficiently large integer h. Since neither 
S nor C is so divisible, we have a contradiction, completing the proof. 

We now are in a position to follow (R). 

Let A € §{y1, +++, yn} be algebraically irreducible and of class n. Let S 
and J be the separant and initial, respectively, of A. Let 2, be the set of all 
forms in §{y,---, Yn} which vigels for all vepeiar w solutions of A, or what is 
the same thing, let 2, = {A}:IS. 





* American Mathematical Society Colloquium Publications, vol. 14. 


\ 
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Following (R), we can show that 2; is a prime differential ideal. The manifold Di 
of 2; is called the general solution of A. ‘ 
The following three results are proved as in (R): 
2, = {A}:8; 
2, is an essential prime differential ideal in the decomposition of {A}; the other Ti 
essential ideals contain S; i: 
_ 21 is independent of the order of the unknowns. 


2*. The resolvent of a prime differential ideal 











Let > be a perfect differential ideal in F{yi1,--- , yn}. Let Sc 
(2*) iy eaoo aD 
be a basic set of 2. We denote the separant and initial of A;,7 = 1,--.,r, 
by S; and I;, respectively. Let 2’ = {Ai,---,A,}:J:S; --- I,S,, that is, let H 
>’ be the set of all forms in ${y:, --+ , yn} which vanish for all regular solutions W 
of the basic set (2*).” Clearly, = & >’. tk 
Following (R), we see that if 2 is prime, then 2 = ZX’. In particular (2*) has 
regular solutions. Also, 2 is the only prime differential ideal with basic set (2*). um 
Let € be the set of all constants (that is, elements with vanishing derivative) w 
in §. € itself is a differential field of characteristic zero. 
We shall need the following in 
Lemma 2*: m,---, mn €% are linearly dependent over © if and only if 0, 
f 
m1 os Nn 
(3*) m1 26°" Dnt tn, 


Min—1 °** Dn,n—-1 


Proor: If >+?_1¢,9, = 0, where the c, are in € and not all 0, then }>7-1 cm: = 0, 
i = 0,1,---,m — 1, so that (3*) must hold. Conversely, let (3*) hold. If I 
n = 1, the lemma is obviously valid. Let n > 1 and suppose the lemma 
holds for fewer than n elements n,. Let 













gp oe Le Bf Toc cn cs cas tchbgetgenceneaeen 
Nin—2 °** Yo-1n—2 Yrtin—2 *** Nn, n—2 





If un = 0, the lemma for n — 1 shows that the », are linearly dependent over 
©. Assume that u, #0. By (3*), 


(4*) > Nrilty = 0, 





t 









® A solution of (2*) is called regular if it does not annul 7,S, --- I,S; . 
“ For i = n — 1, the first member of (4*) is the development of the determinant in (3*) 
by the minors of the elements of the last row. For i < n — 1, (4*) is a statement of the 
vanishing of certain determinants with two identical rows. 
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Differentiating the first n — 1 equations (4*), we find 





2, stir + Le min = 0, 7=0,1,---,n—2. 





Taking into account the last n — 1 equations (4*), we see that })>1 mimn = 0, 
i=0,1,---,%— 2. These equations may be written 
n—1 


D mit = —nnitter , 7=0,1,---,n—2. 
v=1 





Pap RN See Sag 






Solving these equations in wn, +++ , #n—1,1 by Cramer’s rule, we obtain 







Mnt 
ot = — Me, v=1,+--,n—-l1, 
Un 






Hence (u»/Hn)1 = (Unb — Mnit»)/Mn = 0, so that u,/un = ¢,,v = 1,---,n, 
where the c, are constants, not all zero (c, = 1). Using (4*) with = 0, we see 
that }°?-1 ¢-n, = 0, which completes the proof. 

Lemma 3*: Let § contain a non-constant element §. Let A be a nonzero form 
in %ly:,--- Yn}. Then there are elements m,---,m«€& such that A # 0 
when yi = ™,°*+*5Yn = M- 

Proor: It suffices to consider the case n = 1, for the cases n > 1 follow by 
induction. Let A %{y} be of order p, and let Y; = > 20 ci(é');, 7 = 
0, 1,---,p, where the c; are new unknowns.” ‘The Jacobian of the p + 1 
functions Y; of the p + 1 variables ¢; is 


Bits 2 as Fi # 
0& @h--- (€")1 |. E 


0 fp ()» it (é")p 


If w were 0, € would be algebraic over € (by Lemma 2*) and would therefore 
be a constant, so that w ~ 0. Hence Yo, --- , Y,, considered as polynomials 
in ¢,-++,¢p, are algebraically independent.” ‘Therefore, if we substitute 
Yo, Yi1,---,¥,in A for y, y:, «++ , Yp, respectively, A will go over into a 
nonzero polynomial C in the c;. Choose integers yo, 71, --- ,Yp Such that 
C # Owhenc; = ¥;,i = 0,1,---,p. Then the element 7 = > Povit' € F 
leaves A nonzero when substituted for y. 

Using Lemma 3*, we can follow the proof of §25 in (R) to obtain the following 
result. 

Let = be a non-trivial prime differential ideal in F{u, +--+, Ug, Yr» +++ 9 Yr) 
with arbitrary unknowns uy, +++ , Ug (q may be 0.) If § contains a non-constant, 
then there are elements 4, +++ , up € §, and a nonzero form Ge F{u, --+ , Ue}, 
such that if 










































“ (#); is the j-th derivative of ¢. 
“See O. Perron, Algebra, vol. i, Berlin, 1932, p. 134. 
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= ai , / aia ma a” wu" 
(5*) U1, °** »Uq,Yr,°°* »Yp and U1, +++ Ug, Yi, *** Yn 


are two distinct solutions of 2, then either G(%,, --+ , %,) = 0, or 
mlyt — yt) + +++ + uolyd — >) ¥ 0. 


Taking over the proof in §26 of (R), we obtain: 

Let § be any differential field of characteristic zero. Let = be a nontrivial prime 
differential ideal in §{m,---,Uq, Yr,°++,Yp} with arbitrary unknowns 
U1,+++,Uq. We suppose thatq > 0. Then there are forms 


G,M,,---,Mpe Flu, --+ , Up}, 


with G # 0, such that if (5*) are two distinct solutions of  , then either 
G(t, --+ , %) = Dor 


Myth, +++ , tq) (yi — Yt) +--+ + Molt, «++ , Ge) (yp — y>) #0. 


Henceforth we shall deal with a non-trivial prime differential ideal > in 
Miu, +--+ ,Ue, Yi,°*+,Yp}, With arbitrary unknowns ™,--- ,u,, and we 
shall assume that either § contains non-constant elements, or the arbi- 
trary unknowns wu; really exist. Then there exists a triad of forms G, P, 
QeFlu,--+,Ue, Yr, °°+, Yn}, with G free of the y;, and not 0, and with 
P ¢Z, such that if (5*) are two distinct solutions of = for which GP # 0, then 


Wn, +02, Ug, Yi, +685 Yo) xz Ua, +e+, tg, Wis “+5 Yo) 
a * 7 7 *s o: 7 ih * 
Plt, vee, Ug, Yr, +++) Yp) P(t, see, Ug, Yi; #5 Yp) 








Following (R), we form the ideal 
Q = {Z, Pw — Q}:P Cc o{u, co Ug, W,Y1,°°° »Yp}- 
Q is prime, with arbitrary unknowns wm, ---,u,. Further, 


T= ON Flay, «++, Ue, Yr, °°" 5 Yo- 
Let 


(6*) A, Ai,+-+,Ap 


be a basic set for Q, introducing w, y;, --- , yp in succession. We take, as we 
may, A algebraically irreducible. 

We shall show that each A; is of order zero in y; and is linear in y;. The 
equation A = 0 is called a resolvent of >. 

Suppose that not every A; is as described. Let A; be the first A; which either 
is of positive order or is not linear in y; , and suppose that the order of A; in 9; 
is r, with r positive. 

Let U be the remainder of ISJ,S, -.- I,S,GP with respect to (6*), where the 
I’s and J’s are the initials and separants of the forms of the basic set (6*). 
Then U has the properties: 1. The order of U in w is not greater than that of A; 
2. The order of U in Yi, ‘= 1, eos st is not greater than that of A; 3. U is free 
of yi+1, Raid » Uv; 4. U ¢Q. 
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Of all the forms in (U, Q) which have the properties 1-4 listed for U, let V 
be one of least rank in y;. 

We shall prove that the order of V in y; is less than r. Suppose it equals r. 
Then the degree of V in y;, is no higher than that of A;. Let J be the initial 


of V. Then J ¢Q. 
For an appropriate integer m, we have 


J"A; = EV+F, 


where E is a nonzero form of lower degree than A; in y;,, and where F has the 
properties 1-3 and is of lower degree than V in y;,. Hence F eQ, so that 
EV «Q, whence E ¢€Q. 

Lett E = Hy + Avy + --- + Hyyit, where H, ¥ 0, and each H; is free of 
yiut,°** » Yp and is of order less than rin y;. The initial of J” A; is identical 
with that of EHV. Hence H; ¢€Q. 

It is easy to see that there exist positive integers a, a, --- , a; such that 


I'l... lf7H; = Hi (9), i=0,1,---,t, 


where each H; is reduced with respect to the ascending set A, Ai, --- , Aj-1. 
Since H, ¢2, we have H; ¥ 0. Thus, Ho + Aiyi + --- + Hy}, is a nonzero 
form in 2, reduced with respect to the basic set (6*). This contradiction shows 
that the order of V in y; is less than r. 

Now let 


(7*) tl, +++, Ug, W,hi,-++, Up 

be a solution of 2 for which V (and consequently JSJ,S; --- I,S,GP) does not 
vanish. For %, +--+ ,%g,@, Hi, ---,9j-1, the forms A, --- , A; all vanish, 
A; becomes a nonzero form A; in y; of order r, A;, i = j + 1, --- , p, becomes 


aform A; of order zero and linear in y; , and V becomes a form JV in y; of order 
less than r. It follows, by Lemma 1*, that (y; — 9;)V does not hold A;. 

Let 9; be a solution of A; for which (y; — 9;)V does not vanish. When 
¥i=9;,Ai,t7 =j7 +1, --- , p, becomes a linear form of order zero in y; alone. 
The new forms A;, i = j + 1,-+-,p, may be solved to obtain a solution 
Jin, rae = 

Then %, -++ , %y, DB, Hi, +--+, Gi-1, Hj, -+- » Jp is a solution of 2 for which 
GP does not vanish, is distinct from (7*), but has the same w = @ and the same 
ui = @,¢% = 1,---,g. This contradicts the fundamental property of the 
triad G, P, Q, and shows that A; is of order zero in y;. 

ai A; is of degree unity in y; may be proved asin (R). This completes the 
proof. 

- rest of Chapter 2 in (R), except for §33, may now be taken over, word for 
word, 


CoLuMBIA UNIvVERsITY, 
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HANKEL AND OTHER EXTENSIONS OF DIRICHLET’S SERIES 


By R. E. GREENwoopD 
(Received December 14, 1939) 


1. Introduction 
Series of the type 


(1.1) f(s) = yo io tiXe. 


where (a) s is the complex variable, s = o + it, o and ¢ real; 

(b) the numbers a, are all complex constants; 

(c) Ar < Ae < As Sees CAn < eee 

lim A, = © 
are known as Dirichlet series, and their theory has been considered by numerous 
investigators. Widder’ has generalized these series by the consideration of a 
Stieltjes integral. 
If we consider series of the type 


(1.2) o(8) = Da, exp (—Aas)G0as) 


it may be possible to establish convergence properties, subject to certain restric- 
tions on the function G(z), fcr the series (1.2) similar to the well known con- 
vergence properties of the Dirichlet series (1.1). 

In particular, this method of generalizing Dirichlet series was suggested from 
a consideration of series of the type 


(1.3) h(s) = D0 an(idns)* HS (irn8), 
n=1 
where H(z) is the Hankel function of the first kind of order y and is related 


to the more familiar Bessel functions J,(z) and J_,(z) by the relation 


J_,(z) — exp (—vri)J,(z) 
2 sin vr : 





(1.4) H?(2) = 


The function H$” (z) has the integral representation’ for R(v) = —4 





1D. V. Widder, “A Generalization of Dirichlet Series,’ Transactions of the American 
Mathematical Society, Vol. 31, p. 694f. (1929). 
*G. N. Watson, ‘‘The Theory of Bessel Functions,” page 168. 
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(15) Hi°@) = (2) ia i (- a 4 i) [ exp (—ww9(1 ~ sy: du. 





m2 re +4) 
If we replace z by dz and define 
wo w\4 
(16) Mate) = [exp (—wa(1 + ey au 
(2) exp |-«(7 i *)| 

(1.7) A, = * ean. 

I'(v + 3) a 
we see that ie : 
(1.8) (iz) H® (iz) = A, exp (—z)M,/(2). i i 


Therefore the series (1.3) can also be written as 
(1.9) h(s) = >> a, A, exp (—An8)M,(An8) 
n=1 


and thus is formally identifiable as a series of type (1.2). 

For convenience we shall call series of type (1.2) modified exponential series, 
and the particular series of type (1.9) Hankel series. Moreover we shall suppose 
that we are given a fixed A, sequence satisfying 


(a) ea gg Ue eos oe Ve ae 
(b) lim A, = ©. 


The symbol K(a) will be used to indicate a positive constant, depending per- 
haps on a. 


2. Lemmas Concerning the Function G(z) 


In order to establish convergence properties for the modified exponential 
series (1.2) we shall assume that the function G(z) satisfies the following prop- 
erties (which properties are understood to be assumed whenever G(z) is men- 
tioned hereafter) : 

Property 1. G(z) is analytic for R(z) > 0. 

Property 2. G(z) is bounded for R(z) = z% > 0, i.e. say 


|G(z)| << K(m) for R(z) 2am>0. 
Property 3. G(z) has an asymptotic expansion for large z of the form 


(2.1) Ge) xb tt By ..., 
ae 
where by ¥ 0. 
On the basis of these properties we can establish a number of lemmas, which 


we shall state for the function G(xs) where z is a real variable. 
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Lemma 1. There exists a constant r such that for 
(a) z2r> 90, 
(b) R(s) 2 7 > 0, 
we have 
| G(xs) | < Ki(r)’. 


Proor: This is merely a restatement of property 2. 
Lemma 2. Given + > 0, there exists an integer No(r) such that for 


(a) rZ=r, =%n2N)(z), 
(b) R(s) 2 7 > 0, 
we have 
0 < K2(r) S | Gas) |. 
Proor: Using the asymptotic expansion 


a bi , be 
(2.1) G(zs) b+ + aat--, 


and noting that bo ¥ 0, we can get 


|G(xs) — bo| < = for large x 


where B is a constant, and hence we can clearly select x values and a constant 
K2(7) so that we obtain 


0 < K2(r) < | G(as) |. 
Lemma 3. Given d, > 0, there exists a real number £(:) such that for 
(a) r2r > 0, 
(b) R(s) 2 §> 0, 
we have 
0 < K;(r) < | G(as) |. 


Proof is similar to that of lemma 2. 
Lemma 4. For 


(a) R(s) = Rls 
(b) ee 





3 In reality K; depends also on d; , but we are considering the \», sequence as fixed and 
do not indicate this dependence. 





















int 


and 
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| G(zs) 
G(x80) 


as follows at once from combining lemmas | and 2. 


= K,(r); 


Now the asymptotic expansion for dG(z)/dz can be obtained from the asymp- 
totic expansion, 


bi , be 
ze} oe 
by termwise differentiation,‘ thereby leading to 
dG(zs) | bi _ br 


(2.1) G(xs) = bo + + ++, 








(22) oe ee 

OG(as) br Dn 
(2.3) sag ee ORs a 

2 > 
(2.4) CO og 4 Sy... 


ézds ~ as? 253 : 


and from these expansions we obtain 
Lemma 5. For R(s) 2 7 > O there exists an integer N,(r) such that for z = >, , 
n = N,(r) we have 


0G (zs) 
0s 


dG(zxs) 
Ox 


a’ G(zs) 
0x08 


Lemma 6. For R(s) = r > O there exists an integer No(r) such that for q 2 
p = N2(r) we have 


E 
Proor. This follows immediately from the asymptotic expansion (2.3). 
The inequality 


< K;(r), 





< K,(r), 











< K,(r). 








oe) ae < Kir) E - x | < sue 





ba +h)|—-1f@|| < fe +h —f@| 
h = , 


| h| 








‘Bochner, ‘Fourier Analysis,’ (planographed notes, Princeton University), pages 7 
— also Knopp, ‘‘Theorie und Anwendung der Unendlichen Reihen,’’ third edition, 
p. ; 
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shows that 


asl) < | af) 
= dz 


and thus leads to 
Lemma 7. For R(s) = r > O and forg = p = N2(r) we have 


(Sea 


Lemma 8. For &(s) = R(s0) = +t > Oand for x = rz, n = max [N;,(r), No(7)] 
we have 


OG (xs) dG (zs) 
Ox a an 
G (x80) G(as) 


Proor. Writing the function as the integral of its own derivative, we have 
dG(xz) AG(az) | 
7 





< K,(r)|s — %|. 


OG (x80) dG(as) a G(2xz) Clee) = 
~~ . jm yie. / *  0zdx Ox dz 
G(xs0) G(zs) [G(az)/? 


‘4 K;(r)Ki(r) + K;(1)Ke(r) 
[Ke(7)? 




















ae < Ki(z)|s — a] 


by applying lemmas 1, 2, and 5. 
In particular, the function M,(z) defined by 


0 4 
(1.6) M,(z) = I exp (—wa4(1 os z) du, 


satisfies Properties 1, 2, and 3. For M,(z) is ane both in z and in », and has 
the asymptotic expansion’ 


~ Se Te+ Pro t+ 4+) 
M,(2) & x mil +} — m)(22)™ 


aro+p(r+ 5 Wate — Shi e Ge D j 


m=1 23™m\z™ 








and thus, since b) ~ 0, properties 1 and 3 are satisfied. Property 2 can be 
established by taking an upper bound to the complex valued integral repre- 
sentation (1.6) for R(z) = z > 0. 

Thus any convergence proof for the modified exponential series 


(1.2) g(s) = s Gn EXP (—Ans)G(An8) 





5 Watson, l.c., p. 198, and Bochner, l.c., page 15. Care must be taken in interpreting 
the notations in these references for the case m = 0. 
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which uses only properties 1, 2, and 3 of the function G(z) shows at the same 
time that the corresponding Hankel series 


(1.9) h(s) = > a, A, exp (—Ans)M,(An8) 


satisfies the same type of convergence. 
For convenience we state a familiar lemma. 
Lemma 9. Abel’s lemma on partial summation. We have identically 


¥ v.08 ee | (Ve — Vass) 2s Un | + Ve 3 Ue 


n=p n=p u=p 


3. Cuseeeiin of the Modified Exponential Series 


TuroreM 1. If the modified exponential series (1.2) ts convergent for so = 
a) + il) where oo > O, then it is convergent for any s satisfying R(s) = R(so.) = ao. 

Proor: The proof of this theorem is contained in the more general theorem 
which follows. 

TurorEM 2. If the modified exponential series (1.2) is convergent for 8 = 
5) + ito where oo > O, then it is uniformly convergent throughout the angular sector 
about the point s» in the s plane defined by the inequality 


jam (s—s)| Sy <5: 
Proor: We need only show that for s in the angular sector, given 6 > 0, 
we can find an integer mp such that for g 2 p = ™ we have 


n=q 


>> an exp (—Ans)G(Ans) | < 6. 


n=p 
In Abel’s lemma on partial summation we identify 
U,. = Ga exp (—Yn80)G(An So), 


G(An8) 


V, = exp [—(s — 80)An] =~ Gas)’ 


so that we have 


n=q n=q 


p> a, exp (—An8)G(An8) 


n=p 


G(Anq1 8) 


wee Gn) > xp [—(0 — a) Anil Gn4180) 


< 2 exp [=(s — a)Aal G5 


2 Gm EXP (—)m 80)G(Am 80) 


+ exp {—(e — adel G22] 5 ag exp (—An)G0me) | 


m=p 
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By hypothesis, given an e > 0, there exists an integer m such that for q2 
p = m we have 


n=q 


>» a, exp (—An80)G(An So) <. 

n=p 
We take m = max [No(r), Ni(7), m], where No is given by lemma 4 and M1 by 
lemma 5. Then (3.1) becomes f 


n=q 


Dd: an exp (—Ans)G(Ans) 


n=p 








Fia. 1 


Se . exp [—(s — )An] Go — exp [—(s — aren] Gores 


+ «|exp[—(s — ade] Gee , 


Writing the function as the integral of its own derivative, we have for n 2 % 


exp [—(s — 59)An] Wee — exp [—(s — 80)Anqal Coe) 
G(zs) 


G (x80) > | 


< (s—a) [™ exp [—(s — s)z] 
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, me dG(xs) dG(zs) 
ny G(s Ox ox 
+) [0 else - 92 GS later — Gen 


An+ 


< K,(ao)-|s — w-| 


exp [—(s — s)] de 


Ant+1 
+ Kilo) Ks(e)-|8 — %|- [ exp [—(e - s)2] dr| 


by application of lemmas 4 and 8, and hence 


S K,(ou)(1 + Koon) Ween [exp [—9(# — #)An] — exp [—9t(s — &0)Antl. 


Substituting this inequality in (3.2) we get 








Se R(S-S,) 
Fie. 2 


n=q 


> Gn EXP (—Ans)G(An8) 


< So eh Kool + Ko(oo)] [exp [—R(s — 80)Ap] — exp [—R(s — 80)Ag]] 
+ €Ki(oo) exp [—R(s — 80)Ag] 
Feat Kaloo + Ko(os)] exp [9866 — Aa 
|s — 8 | 
Ss €K 1o(a0) R(s — so) . 
Now for am (s — %) = 6,|@|<¥ < 5) we get 


|s — so| 


Bieiieht 1 ee ee: 
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and thus (3.4) becomes 


DX an exp (—Ans)G(Ans) | S eKrw(o0) sec y, 
n=p 
which holds for all s in the angular sector and for g = p = m, and thus the 
required condition is satisfied if we specify that we choose ¢ = 8/(Kyo(a0) sec y) 
and the proof is complete. 
DerFIniTIon: We suppose that the modified exponential series is not iden- 

tically zero. Then, as is the case with Dirichlet series, the series (1.2) may: 

(a) Converge for all s, R(s) > 0 

(b) Converge for no s, R(s) > 0 

(c) Converge for some values of s, R(s) > 0. 
In the latter case there is a real number a such that the series is convergent for 
all s satisfying 9(s) > a, and divergent or oscillatory for all s satisfying 0 < 
R(s) < a. The number a is called the abscissa of convergence of the modified 
exponential series, and the line defined by ®(s) = a is called the line of con- 
vergence. The symbol a will be reserved for the abscissa of convergence of the 
modified exponential series (1.2). The remarks of Hardy and Riesz’ relative 
to the convergence of a Dirichlet series on the line of convergence are applicable 
also to modified exponential series. In section 7 of this paper we consider a 
special case where the line of convergence for the Hankel series (1.9) contains at 
least one singular point. 

We shall now prove two theorems which will show equi-convergence of the 

Dirichlet series and the modified exponential series. 


THEOREM 3. If the Dirichlet series > Gn exp (—Ans) converges for the point 
n=1 


8 = a0 + ito , oo > 0, then the modified exponential series >, dn Xp (—n80)G(An8) 
n=1 
ts also convergent. 
Proor: In Abel’s lemma we identify 


Un = Gn exp (—AnSo), 
Va = G(An80), 
and thus we get 


n=q 


Do Gn EXP (—Nn 80)G (An 80) 


n=p 


n=q—l 


(3.5) =f Wa — G0e419)[ 3 am exp (00) 


n=p 


+ G(Aq80) e Gm EXP (—hn80)- 





* Hardy and Riesz, ‘The General Theory of Dirichlet’s Series,”’ page 9. 
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Now by hypothesis, given e > 0, there exists an integer m such that for g 2 
p = ™% we have 


n=q 


7 a, exp (—An So) <«. 

n=p 
Now choose m; = max [mp , N2(oo)] where Ne is given by lemma 6. We get for 
q 2p =n 


"S q—l 


S [60.48) — GOesi8)}| 2 an exp (—R) | 


n=p 


(3.6) Pa © an exp (—rae)| on™ ae 


n=p |m=p 





at [ *¢ | AG (xs) dx < eK (oo) < eK 3(00) 
d Ox mage 


= p T0Ap oom 


on application of lemma 6. 
On reducing (3.5) by use of (3.6) and lemma 1, we get, forg = p = m 


& axexp (—madGOns) | = Kelod 


n=p 


and thus the convergence of the modified i series is established. 


+ Ki(oo)e S eK(o0), 


TuroreM 4. If the modified exponential series >> a, exp (—)n8)G(An8) con- 
n=1 


verges for the point s = oo + tty , oo > O, then the Dirichlet series >) an exp (—AnSo) 


n=1 
is also convergent. 
Proor. In Abel’s lemma we identify . 


U,, = Ad, exp (—AnS0)G(An 80), 


1 


Lae GOn8)' 


Now, by hypothesis, given ¢ > 0, there exists an integer m such that for gq 2 
Pp 2 m we have 


n=q 


> a, exp (—A)G0n8)| me 


in=p 


Now choose m; = max [No(o), N2(oo), m0] where No and Ne are given by lemmas 
2 and 6 respectively. Similar to the proof of the previous theorem, we get, 
for q 2 Pp = Ny 


pe EE _ aac | | a exp (— 42) | 
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1 
n=q—l1 | m=n An+1 0 (-at5) 
DL | LD am exp (—Ym0)G(Am 80) f _\_G(x80)]) 5 


n=p |m=p Ox 
dG (x50) 
=| Semeee 
. Ox eKs(o0) 
se [| las = Sale 
“Shy |(GGrso)P |” = oodpl Kaloo)? 
by application of lemma 2 and lemma 6. 
Abel’s lemma then takes the form 


n=q 


Do an exp (—Ans0) 


© Lemay ~ atone |Z oer warns 


1 > 
+ | gong || 2 om exP (Ans) 


m=p 





eK s(a0) € 
ee FoAp| Ko(o0)}? K2(o 0) 


for g = p = m, and thus the convergence of the Dirichlet series is established. 


4. Absolute Convergence of the Modified Exponential Series 


THEOREM 5. If the modified exponential series (1.2) is absolutely convergent for 
8 = 01, + tt, where o > O, then it is absolutely convergent in the half plane 
R(s) 2. 
Proor. We need only show that for s in the half plane #(s) = oi, given 
6 > 0, we can find an integer m such that for g = p = m we have 
n=q 


Dd |an| exp (—Anc) |G(Ans)| < 6. 


n=p 


But by hypothesis, given « > 0, there exists an integer m such that for q 2 
p = nm we have 





S eK (ao) 


n=q 


LX |an| exp (—Anor) |GAnsi) | <e 

n=p 
We take m = max [n; , No(o:)] where No is given by lemma 2. Then for g 2 
p = % we get 
n=q 


dX | an| exp (—Anc) | G@Ans) | 


n=p 


> G(An8) 
2X | an| exp (—Avas) | Ona) || GG a 


exp [(o1 — o)al 


eK, (01) 


by use of the above and of lemma 4, and thus the proof is completed. 








ed. 


for 


ne 


en 


IV 


IV 















HANKEL AND OTHER EXTENSIONS OF DIRICHLET’S SERIES 789 


Similar to the case of convergence, there may exist a half plane of absolute 
convergence for the modified exponential series, and @ will be used to denote the 
abscissa of absolute convergence and the equation R(s) = 8 will be used to denote 
the line of absolute convergence. 

The following two theorems will show equi-absolute convergence of the 
Dirichlet series and the modified exponential series. 


TuroreM 6. If the Dirichlet series p> a, exp (—A,S:) ts absolutely convergent 


and (s:) = o1 > O, then the modified exponential series Zz Gn EXP (—An81)G (Ans) 
n=1 


is also absolutely convergent. 
Proor. The proof is essentially a repetition of the argument of theorem 3 


except that lemma 7 must be used instead of lemma 6. 

TueorEM 7. If the modified exponential series > ad, exp (—Ans)G(Ans) is 
absolutely convergent for s = 8, = o; + tt , where o; > 0, then the Dirichlet series 
y a, exp (—An81) ts also absolutely convergent. 


n=] 


Proor. The proof follows that of theorem 4 except that lemma 7 must be 
used instead of lemma 6. 


5. Uniform Convergence of the Modified Exponential Series 


TuEorEM 8. If the modified exponential series g(s) = >. dn exp (—An8)G(Ans) 
n=1 


is absolutely convergent for s = s,, R(s:) = B > O, then the series is uniformly 
convergent for R(s) = B. 

Pre »¥. The method of proof for theorem 5 establishes also the above 
theorem. 

However, it may happen that the series is uniformly convergent for a certain 
range of values of s for which it is not absolutely convergent. Indeed, the series 
will in general have a half plane of uniform convergence, and we use 7 to denote 
the abscissa of uniform convergence. It follows at once that the three abscissae 
of convergence satisfy the inequalities a S y < 8. 

Furthermore, the Dirichlet series and the modified exponential series are 
equi-uniformly convergent as is shown by the following two theorems. 


Turorem 9. If the Dirichlet series f(s) = > G, exp (—Ans) ts uniformly 
convergent for R(s) = y > O, then the modified exponential series g(s) = 
> @, Exp (—Ans)G(Ans) is also uniformly convergent for R(s) = y > 0. 

Proor. In Abel’s lemma we identify 
Un = Gn exp (—Ans) 

V, = GQ0,s8). 
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Now, by hypothesis, given « > 0, there exists an integer such that for {t(s) > y 
and for g = p = m™ we have 


m=4q 


Dd Gm XP (—Ams) | < €. 


m=p 





Thus for g = p = ™ and for R(s) 2 y we get 








& a exp (—D8)40us)| S|" G08) — GOnss8)) Dy dn OxD (Ds 
+ |G(Xe8) || 2 an exp (—Ras 








lA 


("2 | Gans) — G@(Angis) | + | GAs) | 


Now choose n; = max [m , Ne(v)] where N¢2 is given by lemma 6. Then by use 
of lemmas 1 and 6, for g = p = m and for all s satisfying R(s) = vy we get 


[is aes de + | GQ,s) ] 


~n=q—-1 


ie 


n=q 


- Gn Exp (—Ans)G(Ans) 


n=p 


























te dG(xs) 
sf ("| saat 
 Ks(y) Ks(y) 
“ EO) + Kay] EE +Kit) | 
< «K(y) 


thereby completing the proof. 
TueoreEM 10. If the modified exponential series g(s) = > an exp (—An8)G (Ans) 


n=l 


is uniformly convergent for R(s) = y > 0, then the Dirichlet series f(s) = 


> Gn exp (—Ans) ts uniformly convergent for R(s) 2 y > 0. 


n=1 


Proor. In Abel’s lemma we identify 
Un = an exp (—Ans)G(An8), 


1 
Va= G(An8)’ 


Now by hypothesis, given « > 0, there exists an integer mo such that for all s 
satisfying R(s) = y > 0 and for qg = p = m we have 


n=q 


> an exp (—Ans)G(Ans)| <€. 


n=p 


7 
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Therefore for g = p 2 ™ and for all s satisfying R(s) = y > 0 we get 



































ints n=q—1 1 1 m=n 
Lo exp (—An8) » = (aes iin aca) py Am EXP (—dAm8)G(Am 8) | 
+ ats = Gm EXP (—Am8)G(Am8) 
ae tall a | 1 1 
a. | p> GOns8)  GOnus)! * |@QQ8) | 





Now choose m; = max [no , No(y), Ne(v)] where No and Nz are given by lemmas 
2and6. Then, by the use of these lemmas, for g 2 p = m1, and for all s satis- 


fying R(s) 2 y we get 








vt ne +1 “dx dz 1 
2, a exp (—Ana)| S p> I [Gear |t laos 

















K;(y) 1 Ks(y) 1 
slime t Kal * ‘Laaiktor + Kaas 
| < «K(y), 
thereby completing the proof. 


6. Consequences of Convergence 

As a result of theorem 2 a number of properties of the modified exponential 
series are readily deduced. We state a few of these. 

THEorEM 11. If D is any finite region in the s plane such that for all points 
of D we he c = a +6 > a > O, then the modified exponential series (1.2) is 
uniformly convergent throughout D, and its sum, the function g(s), represents an 
analytic function in D. 

Proor. This is an immediate consequence of theorem 2 and Weierstrass’s 
classical theorem about uniformly convergent series of analytic functions.’ 

Corottary. The derivative of g(s) may be computed by termwise differentiation. 

TaeoreM 12. If the modified exponential series is convergent for the point 
8 = 09 + ito (where oo > 0) to the value g(s), then g(s) — g(8) whenever s — 8 
along any path which lies in the angular sector | am (s — %) | S ¥ < 2/2. 

Proor. This is an immediate consequence of theorem 2. 


TuEoreM 13. If the modified exponential series g(s) = p> Gn EXP (—An8)G(An8) 
18 convergent for 8 = 8 = oo + ity where a > 0, and if E denotes the angular sector 


jam (s — »)| S¥ <5 


Sa ee) 


‘Titchmarsh, “The Theory of Functions,” page 95. 
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and if g(s) = 0 for an infinity of values of s lying in E, then a, = 0 for all values T 

of n. W 
Proor. We assume g(s) # 0, else the proof is trivial. Since g(s) is analytic 

in E, in any finite neighborhood of a point of H there must be only a finite 


Cl 


t=) 











| 
1 
| 
| 
| 
! 
| 
| 
| 
| f 
| 

| 

! 

I 

| 

| 





Fie. 3 


number of zeros of g(s). Because of the angular character of the set E it must 
be possible to select a sequence of values s, = on + it, where 

(a) on > &(Ax), — given by lemma 3 

(b) limo, = © 


and such that g(s,) = 0 for each n. 
We consider the function 


_ &Xp (Ais) (Ais) 
r(s) = “G0us) g(s) 


G(An8) 


=a+ 8 an exp [—(n — a8] GQ GQus)" 
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This new function r(s) is convergent for s = s and is uniformly convergent in E’ 
where E’ denotes the deleted (if (1) > oo) angular sector 


o = max [£(A), oo] > 0 


TT 


jam (s — %)|S¥ <5 


We let s > © along any set of values whatsoever in HE’. Since for a uniformly 

convergent series we can take the termwise limit,* we get r(s) > a,. But if 

we let s > © over the set {s,} we get r(s) > 0. Thus a, = 0. Since this 

argument may be repeated for each n, the theorem follows at once.’ 
TueoreM 14. If the two series 


vs) = ¥ on (2.00008 


a) = Fs be exp (—n8)G (dns) 


are convergent at s = 8 = oo + tt where oo > 0 and if in an angular sector E 
there are an infinity of values s, such that Y(s8,) = ¢(8n) then dn = b, for all n. 

Proor. On writing g(s) = ¥(s) — ¢(s), we see that the previous theorem 
applies, and thus we get our conclusion. 

As a consequence of the sets of equi-convergence theorems, we have the 
following results:”° 

THEeorEM 15. If the abscissa of convergence a of the modified exponential series 


g(s) = . a, exp (—a8)G (es) 


is positive, then it is given by the formula 











__ log as 
a = lim ome 


TuroreM 16. If the abscissa of absolute convergence B of the modified expo- 
nential series is positive, then it is given by 


__ log 2 | am | 
0 i 


a ot 


* Titchmarsh, 1.c., page 8. 


* This proof is essentially borrowed from Hardy and Riesz, |.c., page 6. 
” Hardy and Riesz, l.c., pages 7-9. 
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THEOREM 17. We have 


8 —~ oie ee 


no An 





7. Convergence on the Line of Convergence for the Hankel Series 


For Dirichlet series it is well known that under certain special conditions the 
real point of the line of convergence is a singularity of the function.” The 
following analogous theorem holds for the Hankel series (1.9). 

THEOREM 18. If vis real, v = 3, and if all the coefficients of the Hankel series 
are positive or zero, and if R(s) = a > 0 defines the line of convergence, then the 
point s = aisa singular point of the function represented by the series. 

Proor. Let the series be represented by 


D anlidns)*H (ras) 


n=l 


h(s) 


= > a, A, exp (—Ans)M,(An 8). 
It will be convenient to use the function K,(z) defined by” 
7.1) K.(e) = exp ("2") He), 
This function has the integral representation” 
(7.2) K,(z) = [ , exp (—z cosh #) cosh vt dé. 
Thus we consider the function 


(7.3) h(s) = (i) Ed exp (-2) » n(An8)? K,(An8) 


which is analytic at the point s = a + 1 = a, and hence can be expanded in a 
Taylor’s series about this point. We shall assume that the point s = a is not 
a singular point of the function h(s), then the above Taylor’s series expansion 
must be valid for a value of s smaller than a, say a2 < a. By justifying certain 
summation interchanges we shall show that this requires h(s) to be convergent 
at the point s = a: < a, thereby establishing a contradiction. 

The domain in which we are interested does not contain the origin, so that 
the function 


(74) ris) = st exp (=) 4 





1 Hardy and Riesz, |.c., page 10. 
12 Watson, l.c., page 78. 
18 Watson, l.c., page 181. 
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will have the same singularities as h(s). The Taylor’s series expansion of the 
function r(s) about the point s = a will, by hypothesis, converge at the point 
¢= a. Indeed 

h(s) 
r= at (5) 


: = > @n(An)* ‘ exp (—A,8 cosh #) cosh vt dt 
. and 
€ oy oe (s — . r™ (a) 


m=0 


~ > See “a ¥ a0)" 


m=0 


j fi exp (—a), cosh t)(cosh ¢)” cosh vt dt. 
0 


We note that the value of the integral is positive since the integrand is always 
positive, that (—1)”"(az — a1)” is positive, and we recall that we assumed the 
coefficients a, were all positive. Hence we can interchange the order of summa- 
tion“ and we have 


r(az) = >> aa(An)? >, Seger I exp (—a1A, cosh £)(A, cosh ¢)” cosh vt dt. 
n=1 m=0 3 
We recognize that 


> oe a an [ exp (—a1A, cosh t)(A» cosh t)” cosh vi dt 


m=0 


is the Taylor’s series expansion for [ exp (—aeA, cosh ¢) cosh vt dt about the 


point a . 
Thus we have that r(az) converges, 


ee OnV/ An | exp (—azA, cosh ¢) cosh vt dt 
n=1 0 


| r(a2) 


ease > OnV Xn K,(n ae). 


Therefore the series 


Mad a ~~ exp (<3 or a ane anr/ thas H (idns) 


= > dn A, exp (—Ans)M,(An8) 





‘Bromwich, ‘Infinite Series,” page 78. 
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converges, and since a, < a we have a contradiction, a being defined as the 
abscissa of convergence. Hence the value s = a must be a singular point for 
the function h(s). 


8. Inversion Formulae 


For the Dirichlet series f(s) = }> an exp (—)x8) Perron has given the follow- 
n=1 
ing formula for the sum of the coefficients 


1 c+ico f( s) 


(8.1) oe Aap ig exp (ws) ds = 2am; 


where 
Nn < (é3] < Anti , 


c>a. 


In order to establish related formulae for the modified exponential series, we shall 
first prove a theorem concerning the order of the function g(s) as t > ~. 
THEOREM 19. If the series 


g(¢) = 2 dn exp [—On — R)sIG On), 


where s is the complex variable s = o + it and where 0 S h S Xz, 28 convergent 
or finitely oscillating at the point s = oo + 0, then given « > 0, 6 > O we can find 
a value to such that for |t| = ti, ¢ = oo + «, and for all n we have 


(8.2) ; : Am exp [—(Am — h)s]G(Ams) | < 4, 
or é 
(8.3) > a. ep 1&0, “Bedeoley eee 


In particular, forn = ©, we have 


(8.4) g(s) = o(| t}) 
which holds uniformly for ¢ = oo + «. 

Proor. Because of the convergence or finite oscillation of the series at s = % 
and because of the fact that there are only a finite number of terms of type (a) 
below, there exists a constant L such that for all integers m, p, and q 2 P, and 
for r S max [No(a0), Ni(o0)] where No and N, are given by lemmas 2 and 5 we 
obtain 


(a) |a, exp [—(A, — h)oo] | < L 





_ F(x) 
® The notation F(x) = o(¢(x)) means that as z approaches a definite limit, lim o(z) = 0. 
























he 
or 


\~ 


HANKEL AND OTHER EXTENSIONS OF DIRICHLET’S SERIES 797 
(b) |@m exp [—(Am — h)ool]G(Amoo) | < L 


> Am exp [—(Am — h)oo]G(Amoo) | < L. 


m=p 


(c) 


Let N be a function of | ¢|, say N(|t¢|). We consider sums of the form 





3 Gm exp [—(Am — h)s]G(ms) 


and treat two cases, depending on whether N < nor N = n. 
Case 1. Let 


(8.5) 1 S max [Ny , Ni] < N <n. 


We write 
N-1 


> dm exP [=m — H)8}G0m8) = Do + a 


Apply Abel’s lemma to the summation - to get 
N 


Y tn exp [—(m — h)s]GQm8) 











= ¥ on exp [=n ~ HIG0ns) + | expl—(6 ~ 0)(q — ] S22 
* He _ pyy GAm418) 
(8.6) ricci, Mayoral G0m+1 =| 
{= ay exp [—(Ap — HealG0ye0) 
+ be Om EXP [—(Am — h)oolG Om oe) | exp [—(s — a0)(An — h)] ae 
= Si + & + Ss. 
We have 
|Si| = x Am exp [—(Am — h)s]G(Am 8) 
(87) < = | exp [~ Orn — doe] |- aloe) 


S (N = 1)-L-Ky 
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by use of lemma 1 and inequality (a). Also 


[$s] = | an exp [=n — Aoa)]GOnoe) 


G(r2 8) 


-|exp [—(s — a0)(An — h)] —-—"— Gor) 


S L-K4(oo) 
< L-K, 


by use of lemma 4 and inequality (c). 


t 











Now the sum 
n—1 


%= > | exp LK 2s i ae 


— exp [—(Amsi1 — A)(s — o)] Foe) | » a, exp [— (Ap — HalG0,e0) 


G(Am-+190) 


save for the factor exp [h(s — oo)] can be dealt with in the same way as was 
done in the proof of theorem 2, and similar to relation (3.4) we get 


p=N 


|'S2| S L-Kuv(oo) ee i exp [—(dw — A)&(s — on)]. 


Now suppose am (s — op) = 6. Then 


|s — oo| a ——__—_—_—— ha 
Re a) ~ CO = V1 + tan*o s Its 
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fora 2 00 + € Thus we have 


| S2| < L-Kw- 4/ 1+ E exp [—(aw — A)e] 
or 


(8.9) S: = O(|t exp [—(Qw — h)el)." 


Therefore, substituting (8.7), (8.8), and (8.9) in relation (8.6) we get for 1 < 
max [No , Ni] <N<n 


= |Si+ S: + Ss| 





yo exp [—(Am — h)s]G@(Am 8) 
< (NW — 1)-L-Ki + L-Kw4/1 +5 exp [-Qw — Wd + L-Ka 
oe 
(8.10) Y tn exp [—(n — h)s]@(m8) = O(N) + O(|t exp [—w — h)e]|). 
Case 2. For n S N we get at once 
>» Om EXP [—(Am — h)s]GAms)| S nLb-Ki S N-L-K, 
or 
(8.11) © an exp [=n — h)8IG0n8) = O(N). 


Now let N(|¢|) be so chosen that N(| ¢|) tends to infinity more slowly than 
||, and also so that relation (8.5) is satisfied. Then for either of the two 
cases for R(s) = o = op + e and for all n we have 


(8.12) 2 dm exp [—(m — h)8]Gns) = 0(| 1), 
thereby completing the proof. 
Lemma 10. If b > 0, k = O, then 


1 b+ ico ds a > 9 
ii lin oa" TET “=° 


= 0, o<d 
See Hardy and Riesz for proof.” 





* The notation F(z) = O(g(x)) means that | F(z) | < Ke(z) for values of z sufficiently 
hear a given limiting value. 
Hardy and Riesz, l.c., page 50. 
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TuroreM 20. (First Inversion Formula) If kn < w°< An4i and if the modified 
exponential series g(s) = >. Gm exp (—AmS8)G(Ams) 78 convergent for s = a + ity 
1 
where oo > &(d1) (E being given by lemma 3), then for c > oo we have 


1 f°" g(s) exp (ws) = = 6 3 3 G(Am 8) ds 
Qni cio | 6—§ GOus) s) al fa, x a c—tco op} On — “ Ga.) Gn 8) s) 8 


Proor. We consider f 4 function 


r(s) = “P oe 5 a0)| tee —- > Gm EXP (—rn8)40.00)} 





(8.13) = P 
Dam exp [—(Am — «)s] on exp (—wao) 


m=n+1 


where n is determined so that \, < < p re 




















We shall establish that 


[Mano 


—io § 


We consider the closed rectangle as shown at the side. The function r(s)/s has 
no singularities in this rectangle, and so by Cauchy’s theorem we have 


1s) fag mf Hay fH gg 4 [Oa 


iT, § c—~iT; § c+iT' $s ‘d—iT, s 


Now this finite rectangle can be enclosed in an angular sector about the point 
8 = oo + i, and by using the same argument as we did in theorem 2, given 








lified 
+ tt, 


aA 


nas 


en 
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«> 0, there exists an integer m such that m 2 n and for which for p 2 m ah 
we have Ba Sle 
eo G din ) 
Lam exp [—(Am — @)s] ee nas exp (—woo) 
eKio(o0) |8 — a0| a i § 
KA RoR a) AP IR — o0)rv] exp (—dae) exp (9(s)u) 

(8.15) eK |s mj oo | ey 

= Ks R(s — a0) Op br- Dig w)Rle — orl] 

< Ku, 


this estimate being independent of s as long as s is in the angular sector, and 
hence independent of the number d. Thus it follows that there exists a constant 


Ky such that 
— wg] Fn?) G(Am 8) 



































i Am exp [— (Am Gre) (—woo)| < Kx, 
for all s in the angular sector. 
Therefore 
d+iT2 r(s) Books ds d Ee iT» 
A?) < — tise ti tien 
Be 8 ds) S Ku air, 8 | Ku | log d — iT, 
(8.16) 1+ “ 
< 
= Ky log gi iT ; 
d 
and asd — o we have 
1+ ty 
(8.17) log a | oO 
La% 
d 
We shall now show that 
d—iT; d+Tig 
/ r(s) ds and r(s) ds 
c—iT) $ c+iT2 § 


are both convergent as d — ©, and thus we can write (8.14) as 


(8.18) / se fo i "Pitas r(s) 3, — | Te @ pe 


—iT) c—iT c+iT2 


If we let the equation 








(8.19) r(s) = SE a = p(s) 























802 


p(s) 
(8.20) [ 


Now r(s) is convergent at the point s = o). Therefore the series for the func- 


define the function p(s) we have 
r(s) exp [(Angi — w)8]G(An8) 


tion p(s) converges for s = o9. Identifying 
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bp = Antp ; 


Antp = Mp; 


we see that theorem 19 is applicable to the function 


p(s) = . by exp [—(up — ui)8]G(ups) exp (—wor). 


Therefore, given ¢, > 0, we can choose 7’ such that 7 > 0 and 


s=ot+1T, 
|p(s)|< aT. fory o 





Then 
dt+iTs 
porno 4 
c+iTe § 
(8.21) 
since BO < me I 
isl ® Vern 
(8.22) 
and likewise 
(8.23) 


3| = 


(7: 


ec 
> Ty 








+iT2 s 


IA 


€] T» 


| *87* p(s) exp [— (nia — w)s] 


G(An 8) 








+iT'2 
< €j T> 1 
~ K3(€) Ve+ T? 


€1 1 
Ks Anti Fa w? 


T2 





< 


and Vea ti <1. Thus we conclude that 


e+T; 


cr r(s) Qs 


+iT 2 8 


lim 


T2-0 





lim 


T 1-0 





I — r(s) ds 


—iT; 8 


| 273 exp [—(Ans1 — w)8] 
A 8G(Am 8) 






















> Am EXP [—(Am — Ang1)8]G(Ams) exp (—woo) 


m=n+1 


de| 


{- exp [— (ngs — w)s]ds 


Therefore from (8.19) and the above we obtain the evaluation 


[a= 


c—tco 


0, 








anc 


Usi 
inv 


is ¢ 
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wh 
ex 
by 


HANKEL AND OTHER EXTENSIONS OF DIRICHLET’S SERIES 803 


and recalling the definition of r(s) (equation (8.13)) we have 





+ 0(s) exp [w(s — o0)] 
Lk “8 G(An 8) 5 


fi | Am exp [— asl ores) | exp lol — ol. 


Using lemma 10 to evaluate the term in which a, is a factor we obtain the first 
inversion formula, 


1 f* g(s) exp (ws) gore Gms) ds 
mi cio | =§ “GOn8) oe > c—ico net On — ws 1G0.8) 3 s- 


TuroreM 21. (Second Inversion cians If the series 
g(s) = p> Gm EXP (—Ams)G (Ams) 


is convergent for 8 = oo + tto and tf Ax» <w < Anz, then for c => oo we have 


n ct+ioo 
g(s) exp (ws) = (008) Po > Am | . —. — 2)IG0ms) S 


1 c+ico 

ni c—too 

Proor. The proof follows along the same lines as did the proof of theorem 
20 except that we do not have to introduce the factor 1/G(A,<8). 

Remarks: For the Hankel series and for the special cases when » is an odd 
half-integer, the second inversion formula can be simplified. Let » — 3 = 
where k is an integer. It will be recalled that the Bessel functions have simple 
expressions for such values, and likewise the function M,(\,.s) can be expressed 
by the finite series 


M,(\m8) = 3 exp (—wu*(1 + zt) a 


- [oo sve + (Hats +(@) (ots) +--+ (ae) 
0 2m 8 2m 8 2m 8 
k\ (k + 1)! (k + 2)! (2k)! 
= ! ey i eee ES 
esis (‘) Dae (3) mse tT Oknsy? 
and on substituting this in theorem 21 and using lemma 10 we get 


™“* h(s) exp (ws) 7. _ + k + 1)! (w — Xm) 
=| = Yo an [at + (FOR wine 


k\ (k + 2)! (w — Xm)” (2k)! (w — Xm)" 
+(5) @j 2 * 't@p ft |. 


We recall that for y = 4 the Hankel series reduces to the Dirichlet series, and 


Ke note that the above formula reduces to Perron’s formula (8.1) for y — } = 
= 0. 
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9. Conclusion 


By defining (9.7 
eas 0Os2<m, as ¢ 

AQ) = + 

n—1 seri 

(9.1) A(ha) = : am + 2 pe 
A(z) = Lam, he i< @ < Meike sing 

we can express the Dirichlet series f(s) = Da exp (—A,8) as the Stieltjes in 
integral f(s) = [ ‘ exp (—2s) d(A(x)), and Perron’s formula for the sum of the 
coefficients becomes 7 
(9.2) AG) = st [2 exp (esas aug 


Likewise the modified exponential series g(s) = >> dn exp (—An8)G@(Ans) becomes 
1 

the Stieltjes integral 

(9.3) g(s) = | exp (—28)G(a8) d(A(z)). 


It should be possible to obtain a convergence theorem similar to theorem 2 from 
a consideration of the Laplace-Stieltjes transform.” 
An integration by parts of equation (9.3) would give formally 


(9.4) g(s) = { s exp (—2s) {G¢es) + 209) 4(2) dz, 


which can be regarded as an integral equation with the kernel 


(9.5) s exp (—2s) (ote) a ote | 


and the two inversion formulae for the modified exponential series could be 
written in the form 


(9.6) [ * dA(z) | [ me exp [—(x — w)s}G(x8) =| ‘ ] si 18) CP O° as, 


and 





18 Private communication from Prof. J. D. Tamarkin to the author. 
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rs) c+ico = + G(zs) ds = ct+io g(s) exp ws 
(9.7) [ dA(z) i exp [—(z — )s] GOus) a] = ly ~sGOns) ds, 


as can be formally verified by substituting for g(s), and changing the order of 


integration. 
Hadamard’s results relative to the composition of singularities of Taylor’s 
series show that the only possible singular points for a function represented by 





the series > nb, exp (—ns) are points of the form a + 8 where a is some 
singular point of the function f(s) = > a, exp (—ns) and @ is some singular 


point of g(s) = > b, exp (—ns). Mandelbrojt obtained analogous results 


applicable to the general Dirichlet series, but the present author has been unable 
to get an extension of Mandelbrojt’s work applicable to the modified expo- 
nential series. 

In conclusion I wish to thank Prof. S. Bochner of Princeton University who 
suggested this topic and whose advice and criticism have always been helpful. 
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ON THE INTEGRALS OF CANONICAL SYSTEMS 


By Cart Lupwia SIEGEL 


(Received December 19, 1940) 


1. Trigonometrical series 


We consider a canonical system of differential equations 


(1) ty = Ay, y i= —H;z, (k = 1, --- , n) 


and suppose that the real function H does not contain the independent variable 
t and is, in a neighbourhood of the origin, an analytic function of the 2n variables 
%1,°++,Yn- Let the number » of degrees of freedom be at least 2. We sup- 
pose moreover that the origin is an equilibrium point of the system; i.e. all the 
2n derivatives H,, , Hy, (k = 1, --- , n) vanish at the origin. It may also be 
assumed that the function H itself vanishes at the origin. Denoting the 2n 
variables 21, --- , Yn by 21, +++, Zan, we write the system (1) in the form 


2n 
(2) a = 2d Qnuzi + Ri (kK = 1, +--+, 2n), 


- 


where R; is a power series in z,, --- , Ze, beginning with terms of the second 
order. 

Let \1, -++ , Aen be the characteristic roots of the matrix (a). In our case 
of a canonical system, the characteristic polynomial is an even function; hence 
we may arrange the roots such that 


(3) Neen = —Ne (k = 1,-++,n). 


We suppose that 1, --- , An are linearly independent, with respect to the field 
of rational numbers; this means that the relationship 


gid + sie + grrdn = 0 


holds in integral numbers g; , --- , gn only for g; = 0, --- , gn = 0. 
If w is a power series of 2n variables 7%, m (k = 1, --- , n) without linear 
terms and the determinant | w,,,, | ~ 0 at the origin, the equations 


(4) & = Wy, , Yr = Wy, (k = 1,-++,%) 


define a contact transformation of the variables x, --- , yn into the variables 
£1, +++, mm, and the canonical system (1) is invariant: 


(5) & = H,,, m= —Hz, (k = 1,-+++,) 
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It has been known’ for a long time that after an appropriate contact transforma- 
tion the function H will depend only upon the n products 


(6) em = Se (k = 1,.--,n) 
and take the form 
(7) H= > esi + R, 


where R is a power series in the n variables {, , --- , ¢, beginning with quadratic 
terms. The integration of (5) for this normal form of H is then immediate. 


Since 


(8) Hy, = &H;, , Hy, = mH;, , 
we deduce from (5) that the n products ¢; are constant; hence 
& = ayer’, om = Bye “Ee (k =1,---,n), 
with arbitrary constants a,,6;, and 
OBE = Sk (kK = 1, -+-,m). 


The original unknown functions 2; , ---+ , yn become now series with the general 
term 
Cos gieentats: tH8 Ft, 

where the coefficient c,,...,, denotes a constant and g;, ---, gn integers. If all 
the characteristic roots \, , --- , An are pure imaginary numbers and the initial 
values of 4;, +--+ , yn real, then all the values H;, , --- , H;, are pure imaginary, 
and we get a representation of the solutions of the canonical system by trigo- 
nometrical series. 

This elegant method of solution has also been generalized to the case of a 
function H which contains explicitly the variable ¢, in periodical form, and is 
closely related to the important researches of Delaunay, Hill and Poincaré’ in 
celestial mechanics. However, there is a serious objection: The question of 
convergence has never been settled. If we define sum, difference, product, quotient 
and derivative of power series in a formal algebraic manner, we can perform 
these operations also with divergent power series, and then we can construct 
by straightforward calculation a transformation of the type (4) which reduces 
H to a power series of the n products &, alone. But no proof for the conver- 
gence of this contact transformation has been given, with exception of some 
special examples, when the integration of the system (1) can also be carried out 
by elementary methods. On account of the small divisors appearing in the 





1K. T. Whittaker, On the solution of dynamical problems in terms of trigonometric series, 
Proceedings of the London Mathematical Society, vol. 34 (1902), pp. 206-221. Cf. also 
G. D. Birkhoff, Dynamical systems, New York (1927), chap. 3, and E. T. Whittaker, A 
rg on the analytical dynamics of particles and rigid bodies, 4th edition, Cambridge (1937), 
chap. 16. ‘ 


* Cf. H. Poincaré, Les méthodes nouvelles de la mécanique céleste, Paris (1893), vol. 2. 
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coefficients of the transformation, it seemed to be probable’ that the series will 
diverge in general, but no single example had hitherto been found. From Poin- 
caré’s well-known theorem‘ on the analytic integrals of canonical differential 
equations we can only infer that those series do not uniformly converge, if 
hi, ++, An are variable complex parameters, whereas this theorem cannot be 
applied to a fixed function H. 

We arrange the coefficients of H in a certain order and denote them by 
hi, he, hg, +--+. We assume that the power series H converges in a neighbour- 
hood of the origin and that the characteristic roots \; , --+ , A, are pure imaginary 
linearly independent numbers. The corresponding systems (hi, he, hs, ---) 
form the points of a space 2. A point of = is called singular, if the transforma- 
tion of H into the normal form (7) cannot be performed by a convergent contact 
transformation (4), and else regular. 

THEOREM 1. Let (c;, C2, ¢3,---) be a point of 2 and &, @, 6&,-++ an 
arbitrary sequence of positive numbers. Then a singular point (hi, he, hs, --+ ) 
of = exists in the domain 


Cc—-—a<h<catea (k = 1, 2,3,---), 


This theorem asserts that the singular points are everywhere dense in 2. We 
shall reduce the proof to that of another theorem concerning the integrals of a 
canonical system. It would be important to obtain also some information about 
the distribution of the regular points of 2, but this seems to be rather a difficult 
problem. We do not know e.g., if the regular points are also everywhere dense 
in 2 and if they constitute an open connected set of points. In particular, it 
would be interesting to decide, whether H is regular or singular in the special 
case of the restricted problem of three bodies, with respect to the equilibrium 
solutions of Lagrange. But this seems to be beyond the power of the known 
methods of analysis. 


2. Integrals 


If P is any convergent or divergent power series of the 2n variables 2 , ++ , Yn; 
we define the Poisson bracket (P, H) by the power series 


(P, H) i 2d (P,,Hy, i P,, Hz). 





*G. D. Birkhoff, Surface transformations and their dynamical applications, Acta Mathe- 
matica, vol. 43 (1922), pp. 1-119. Cf. on the other hand G. W. Hill, Remarks on the progress 
of celestial mechanics since the middle of the century, Bulletin of the American Mathematical 
Society, 2nd series, vol. 2 (1896), pp. 125-136, and E. T. Whittaker, On the adelphic integral 
of the differential equations of dynamics, Proceedings of the Royal Society of Edinburgh, 
vol. 37 (1918), pp. 95-116. 

*H. Poincaré, Les méthodes nouvelles de la mécanique céleste, Paris (1892), vol. 1, chap. 5. 
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ON INTEGRALS OF CANONICAL SYSTEMS 


We call P an integral of the canonical system (1), if the equation 


(P,H) =0 
holds identically in the variables; in other words, P is an integral, if the rela- 
tionship 
P=0 


follows from (1). 
The expression (P, H) is invariant under any contact transformation. By 
(6) and (8), we obtain . / 


(¢%,H) = 0 (k = 1,---,m). 
Introducing into {% = &m, the original variables 2, , --- , yn, , we find n power 
series 
(9) fe = Pi(ti, +++ , Yn) (kK = 1,---+,n) 


which are integrals of (1). Since the functional determinant of & , --- , 7, as 
functions of 21, +++ , Yn does not vanish identically, these » power series are 
certainly independent one from another, i.e. there exists no power series of the 
variables P; with constant coefficients not all zero, which vanishes identically 
in the variables 71, +--+ , Yn. 

Obviously the function H itself and more generally any convergent power 
series in the single variable H is a convergent integral. If H is regular, in the 
sense of our former definition, there will exist a convergent contact transforma- 
tion reducing H to the normal form (7), hence the integrals (9) will then also 
converge. Moreover, by (4), we deduce easily that the integral P, cannot be 
expressed as a power series in H alone. Therefore Theorem 1 is contained in 

THEOREM 2. Let (c;, C2,¢3,--- ) bea point of 2 and |, &, &,--- an arbi- 
trary sequence of positive numbers. There exists a point (hi, he, hs, --+) in the 
domain 

Ce — & <he< te t+ & (K = 1, 2,3, ---) 


such that any convergent integral of the corresponding canonical system (1) is a 
power series in the single variable H. 

The proof of this theorem depends upon several lemmata. 

Lemma 1. Any integral of (1) can be represented as a power series in the n 
integrals Py, +++, Py. 

Proor: It is obvious that the sum, the difference, the product of two integrals 
and more generally any power series in a finite number of integrals without 
constant terms is again an integral. Let P(x, --+ , yn) be any integral of (1). 
By the contact transformation (4), this integral becomes a power series in the 
variables |, -++ , 9, with the general term 


Ca,---6, 1 (Ef* nk"), 


k=1 
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where ¢q,...8, denotes a constant. Consider now the m differences 
an — Br = ge (k = 1,-..,n), 


The sum of all special terms with g, = 0, --- , gn = 0 is a power series T in the 
n products &m = % = Pilati, +++, Yn), and the expression 


P-TzJ 


is again an integral. If J is not identically zero, take a term j of least degree 
(a, + Bi) +--+ + (an + Bn) = d and calculate the terms of degree d in the 
power series (J, H). Since 


(J, H ) = 0, 
we obtain from (6) and (7) the relationship 


DY MlGiits — min) = 0 
or 
:¥ Ange = O. 
k=1 


This is impossible, the numbers \; , --- , A» being linearly independent and the 
integers g; not all zero. Hence J = 0 and P = T a power series in the integrals 
P 1 pees Pry P. ne 

3. Linear transformation 


For our further purposes it is practical to introduce new variables uw, «++ , Un 
by a special linear contact transformation, which reduces the quadratic terms 
of H(x1, +--+, Yn) to the normal form Ayu, + «++ + Aguada. Obviously we 
find such a transformation, if we replace in (4) the power series w by the sum 
of its quadratic terms. Let 


(10) a = > (Cert + Cx,24n02) (k= 1,---, 2n) 


be this transformation, 
(11) ty = (x2) 


its matrix and 
(12) H = E(w, +--+, 0%) = D> AcUere + +e 
k=1 


the power series for H as a function of the new variables. By (3) and (12), the 
transformed canonical system is 


(13) ti = Ey, = Mure t+---, te = —Eu, = Nene +++ (k = 1,-++,m) 
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Let % be the matrix of the coefficients a, in (2) and D the diagonal matrix 
with the diagonal elements 41, +++, en. By (2), (10), (11), (13), we deduce 


AC = CD 
or more explicitly 
Me, = err (k = 1, +++ , 2n), 


where ¢, denotes the kt column of the matrix ©. Since A,, --- , A» are linearly 
independent, the 2n characteristic roots i, --- , Aen are certainly different one 
from another. Therefore the general solution of the linear equation 


Ar = m 


r= kp 


with an arbitrary scalar factor p. On the other hand, d, is pure imaginary, 
and hence by (8), the bar denoting the passage to conjugate complex numbers, 


We, = CrArin (kK = 1,-+-,n). 
This proves the relationship 
(14) Ck = Ch+nPre (k = 1, eee , 2); 


with a certain scalar factor p,. Hence the linear functions z of the variables 
uy, +++, 0, are not changed, if c; is replaced by ¢, (J = 1, --- , 2m) and at the 
same time the variables uw, , v, by @,, 6: , where 


(15) a; = pi v1 ’ Od, = pm (J = 1,---,m); 


For any power series F, we denote by F the power series with the conjugate 
complex coefficients and the same variables. Since H is a real function, 


A(x, <* » Yn) - A(a, +++, Yn), 


we obtain 
E(u, +++, Un) = E(t, +++, On) 
and in particular, by (12) and (15), 
he = Vadepe (k = 1,---,n). 


The characteristic roots \, --- , An are pure imaginary; hence the same holds 
for the n numbers p,,'-++ , pn. Obviously the canonical form of the system 
(13) and the quadratic terms of E are unchanged, if the variables u,, v% are 
replaced by ay uz, axv, (k = 1,---,m), where ai, --- , @, are arbitrary con- 
stants ~ 0. Denoting the columns c,a;, Ceind% again by cz, Chin, We have to 
replace the factor p, by px(a,4,) . Therefore we may assume that py = +7. 
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If p, = +7 for any value of k, we replace moreover wz , vz by v% , — uz and obtain 
the case px = —7. Hence it is allowed to assume 


pr = 1 (k& = 1,--.,n), 


Now we change the notation. We suppose that the function H = 
H* (2, , --+ , yn) and the matrix € are given and denote the characteristic roots 
Mi, °°* An by Ai, -++,Aa, the corresponding power series E by E*. Let Pp 
be the set of all convergent power series 


(16) E(u, Sesh Un) = > AgURVE + ++ 


satisfying the condition 
E(u, -++, Un) = Elin, «++, tun), 


with linearly independent values of \;, --- , An. Then the inverse of the linear 
transformation (10) takes any such function E again into a real power series H 
of the variables 2, = 2%, Yr = Zkin (kK = 1,---, 7). 

If F is any power series in the variables u,---,v,, we denote by F; 
(l = 0, 1, 2, ---) the sum of its terms of order | and by [F | the maximum of the 
absolute values of the coefficients of these terms. The proof of Theorem 2 is 
now tantamount to the proof of 

THEOREM 3. Let @, 6, &,---+ be an arbitrary sequence of positive numbers. 
There exists a power series E of P such that 


[E, — Ef] <a (l= 2,3, ..-) 
and any convergent integral of the canonical system 
Ux = Ey, , th = —Ey, (kK = 1, +++ ,n) 


is a power series in the single variable E. 
Let E be an arbitrary power series of the set P and H(2, --- , yn) the corre- 
sponding real function. By the linear transformation (10), the n power series 


(17) Se = P(t, +++ , Yn) (k = 1,-+-,m) 
become integrals of the system 
(18) % = E,,, ih = —Ey, (k = 1,---+,7). 


Let Q be one of these integrals and 
Q: = aX (arm: Ux Ur + Barter + yarn) 


the sum of its terms of second order, where 


kt = Ak, YR = Vk 
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Calculating the terms of second order on the left-hand side of the equation 
(Q, E) = 0, 
we find by (16) the expression 


DS fan(An + Arua + Ba(An — ArluRvr — yar(Ae + Ar)verr}. 


k,l=1 
Hence 
or = 0, Ye = 0 (k,l = 1,---,n), 
Bir = 0 (k #1;k,l=1,---,n), 
and ¢;, takes the special form 
(19) t= De baum + + (k =1,---,n). 


By (4) and (10), the variables &, --- , m can be expressed as power series in 
u,+++,Un. The determinant of the linear terms in these power series is 
different from zero. Let yi, --+ , ¥, be indeterminates and 


(20) x = 2, bute (1 = 1, ---,n). 
k=1 
By (6), (19), (20), the quadratic form 
DL vetem 
k=l 


of the 2n variables &, --- , 72 has the same rank as the quadratic form 
n 


> XiUY 
=1 


of the 2n variables uw, ---,0,. Hence the n linear forms (20) vanish simulta- 
neously if and only if y, = 0 (kK = 1,---,m), and consequently their deter- 
minant is different from zero. On solving for uw; from (19) we obtain n in- 
tegrals of (18) in the form 


S? =uat+.--- (i = 1,---,n). 


Let us suppose that S“ contains a term of the special type 
(21) c Il (uno), a++--»-+a,=g>1 
k=1 
and that the degree g of this term is as small as possible. Then the integral 


se ‘ial IIls”™ 


k=1 
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does not contain a term of this special type of degree < g. It is now obvious 
that we can also construct n integrals 


a” = uw +... (1 = 1,---,n), 


which do not contain any term of the form (21). Moreover the integrals 
S*...., 8 and consequently the integrals (17) are power series in 8”, 

..s'”, Lemma 1 gives the result, that any integral of (18) is a power serie 
of the variables s, -.- , s“. This holds in particular for the integral E. By 


(16), the power series expressing H as a function of s*’, --- , s\” has the form 


E = dons” + «+; 
k=l 


as a power series in E, s™, --- , 8". There- 


hence we may also represent s 
fore we have 
LemMa 2. There exist n — 1 integrals 
‘ 8” =umt+.--- (] = 1,---,n-1) 


containing no term of the type (21) and such that any integral of (18) is a power 
series in E, 8, .-. , 8”. 

For the rest of our investigation, we will only consider the case of two degrees 
of freedom. As a matter of fact, the generalization of our proof to the case n > 2 


requires more complicated calculations, but does not present more serious 
difficulties. 


4. Estimation of the coefficients 


Lemma 3. Let & » be complex numbers and G a homogeneous polynomial of 
degree r in the variables u; , U2, 1, V2. Then 


(22) (\E| + |n|)[@] S (2r + 2) | (Eurvr + quar)G |. 


Proor: We can write 





_*< (1) 
G Xe, 


where G" is also homogeneous in the variables w, we, of degree l. Since 
[G@| is the maximum of the values [G®] (lL = 0,--+ , r) and |(Euswit+nuar2)G| the 
maximum of the values| (gain + nuev2)G© |, theinequality (22) will certainly hold, 
if it is true for G? instead of G and 1 = 0,---,7. Hence we may suppose 








l 

6 I- 

G= h® PrU1 Us Pm 
k=0 


_ where g denotes a homogeneous polynomial in v,, v2, of degree r — I. If 
|&| > | |, we interchange é, u; ,¥, and 7, Uw, v2. Consequently, we have only 
to consider the case 


(23) [é| S| al. 
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Putting 
gu = 0, gin = 0, 
DB, = Evige-1 + moyr (k = 0,---,1+ 1), 


we obtain 
i+1 


(Eun + nuev2)G = p> },ui uy 


k 
(nv2)*" gy, = 2», (—é)"?(qv2)? &, 
whence, by (23), 
k 
Inilel S p> [#,|. 


Since [G| is now the maximum of the numbers |g | (k = 0,---,2) 
((éun + nuav)G | the maximum of the numbers [| (k = 0,---,2 + 
the inequality 





and 
1) 


’ 





In| [@| S @+ 1) [Eur + nuer)G| 


holds. Moreover 


[él + la] 2/1, 


and the lemma is proved. 

By Lemma 2, the existence of n — 1 integrals s° (J = 1, ---,m — 1) with 
certain properties was stated. In our case n = 2, let us denote the integral 
s” more shortly by s. Then 


t+1sr+1, 


(24) s= > s, 8 = WN, 
k=? 


where s; is a homogeneous polynomial in uw, ue, v1, %, of degreek. If k > 2, 

the polynomial s, does not contain a term of the special form c(u:0:)"(uav2)’. 
Lemma 4. Let the canonical system 

(25) m=, t= —Ey (k = 1, 2) 


possess a convergent integral, which is not a power series in E alone. Then the 
sequence 





log [s,| ; ES 
i log k (k = 2,3, ---) 


has a finite upper bound. 
Proor: By Lemma 2, any integral P(u; , ws, v1 , ¥2) of (25) can be written as 
& power series in E and s, 


P= 9 Caps" EE”. 
a,p=0 
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We assume that there exists at least one coefficient 
Cap = 0, a> 0; 


take a + 8 = g as small as possible. If P is a convergent integral, the same 
holds for the expression * 
g—1 


p(ur, U2, 1, V2) = P or. Dd cog E*. 
B=0 


By (16) and (24), we find the decomposition 


ote Pk ; 


where 


Py = 2 e098 Bt, & = WN, E, = Amr + A2UeV2. 


The polynomial 


(26) A= Ory _ bs Cag 8s Ee 
O82 at+p=g 


does not vanish identically; since it is homogeneous in the two variables uw, 
and Uev2 , it may be written in the form 


g—1 
(27) A=c II (Emr + n° u2v2) 
h=1 


with constants é”, 7” not both zero (h = 1, --- ,g — 1) and a constant c # 0. 
We denote by z, y, z any three of the variables u; , ve , 01, ¥2. Since pis a power 
series in s and E, the functional determinant 


dE, p, 8) _ 
d(x, y, 2) 


identically in z, y, z. Calculating the terms of degree h — 3, we obtain 


0, 


d(Ea , Pay 8) 
ee ee ie h = 2g + 4). 
atbty=h (x, y, 2) : = 


We apply this relationship for z, y, z = wu, Ww, v1 and forz, y, z = UW, UW, rr. 
Denoting the corresponding expressions 


d(Ea, ps) d(E., pp) d(E., pp) 
ent Fr a Na) FB ee =2 2 
atf—t d(y,z) > atp—t U(z,z) ’ atpmt d(x, y) dada 
by Aun, Aa, Asx and Bu, Bu, Bai, we find 
Os Os 0s 
28 ae nites —_|= 
(28) a Uo, + Az Dus + Ax =) 0, 


0s. 0s 0s 
29 a. O8y O8y\ _ 
(29) zt ” au, + Pay, + Ba a7) - 


wom aalinenatipaesit badges natn eT 


hold 


The 


If ¥ 


whe 


(30) 


(31 
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Let u1, #2, us, Ma be certain appropriate positive constants, which do not 
depend upon the subscript k appearing in the formulas. Since the power series 
E and p converge, the inequalities 


|Z, | < Mi (k = 2), 
[ pe | < ui ; (k = 29) 
hold, and consequently 
ae} & | ODE | k 
Ox < hur, ay < ky. 


re 
3 


If ¥, denotes any one of the six polynomials Ay, --- , Bs, , we have 


The polynomial _ is a sum of (' " terms, hence 


[ve | < ms (k = 2g + 2), 
whence 
(30) jy s1(7t 7) ui Sy}. 





By (26), the identity 
d(E2, Pr) _ , AEs, 82) 


d(z,y) (zx, y) 
holds and therefore we obtain in the case 1 = 2g + 2 for Au, --- , Bs; the 
values AoA, 0, —Agdi2A, 0, AditeA, —odi2A. From (28), (29), (30), we 
deduce now the inequality 


OS: Os; | 
AeveA comene sms pie 
2V2 (u oun, V1 ) 





al 


+ 








| Os; 9) 
Mon A oe oe oe 
2U1 (u 5 V2 ave 








k-1 


< 2d (y + 3)* [s)]u2**?7 
S 


and, by (27) and Lemma 3, 








k—1 
+ | 2 — 0 28 <kh [elms (k= 8). 


y=2 


| @ f) 
wane, 
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w = au?! vf us? vf? 
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is any term of s; , then TI 
lo: — Bil + lo: — &| 21, supp 

Ow Ow Ow Ow _ 
” du, mie av; spi (a1 Bio, U2 dus V2 dee = (ae Baw, W 





and (31) implies 
k—1 
(32) [|< he X [s,| us” (k = 3,4,...), are 
Y 
Obviously the inequality 
(33) | s1| S (20°? ys) 
is a 


is true for] = 2. If it is proved for] = 2, 3,--- ,k& — 1, we find from (32) 
the relationship (34) 





i k-1 k=l (35) 
[in| < bP 5) (Qy* ya) uh S (hE wa)? DO? < (2K yy), 
a ™7 and 
and (33) holds for! = k. Hence lowi 
log | 8 | 
klogk ~ “: (37) 
5. Proof of Theorem 3 (38) 
On account of Lemma 4, it is sufficient for the proof of Theorem 3 to con- f Ym 
struct a power series . (39) 
E=DE, } Iti 
a | are 
with the following 4 properties: . 
I) Ex(us , Us, V1, V2) = Ex(ivy , ive, tu, ime) (kh = 2,3, +4+); | the 
) 

II) Ey = dy; + AgUade, 
where v1 , dz are linearly independent; | hol 
IID) _ [R—El<e (k = 2,3, ---); | (40 
IV) the sequence Sto 

log [sz (é = 2,3, ---) } (38 
k log k , | 


has no finite upper bound. 
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The positive numbers @, ¢;,--- are arbitrarily given. Obviously we may 


suppose that 
& <1 (k = 2,3,...). 


We begin with the construction of \;, \2. The coefficients Ay , Az in 
Ex = dru, + Ar dar» 


are pure imaginary and linearly independent; hence 


w* = a 
de 
isa real number. We choose two integers qg, r such that 
(34) q>1+|o*|+2|y la, 
(35) lqo* + r| <1 


and define three sequences of numbers gm, 1m; lm (m = 1, 2, +--+ ) in the fol- 
lowing manner: 


(36) Tm = Gu (: _ > a); 


(37) ln = gm + | Tm |; 

(38) n=; 

(m+. is the least integral power of q satisfying the inequality 
(39) qm > Gm + 4| Ae |qain’. 


It is obvious that the numbers qm, 7m, lm are uniquely determined and qm, ln 
are positive. For the exponent a» in 


qm = g”™ 
the inequality 
Omi > 2am 
holds. Since a; = 2, we obtain 
(40) Gn 2", Qm41 — Om > Gm = 2”, 


hence r», and J» are integers and the sequence ¢ngni1 tends to zero. By (34), 
(38), (40), the series 


(41) = > qe 
k=1 
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converges, and we find the estimations 


(42) 0< Lai <a Dag <2 (h = 1,2, ...) 


koh 
(43) 0<0<a' Sq. 
Moreover, by (35), (36), (41), 
| rm | < qn(jo*| + 9" + 4), 
whence, by (34), (37), (88), (39), (43), 
(44) mgs — Im 2 Gms — Gm — |tm| > mq’ — 1 — |w*| — 2g") > 0. 
Let Ax, Ae be defined by 


(45) m=at(o-2), he = Ae; 


then A; , Az are pure imaginary and 
Ey = Air, + Agtad2 
satisfies I) with k = 2. For the expression 
(46) Pm = Mdm + Alm (m = 1,2, +++) 
we find, by (36), (41), (45), 


2) 


pm = ede Dy Ge 
k=m+1 


and by (42) 
(47) O < pmdz < 2qmQnis « 
If x, y are two indeterminates, the identity 
(Mit + Day) gmd2" = (Gmy — Tait) + pmdz't 


holds. For given integral values of z and y, the number qmy — 1m is integral, 
whereas the absolute value of pmAz'z, by (47), is less than 1, if m is sufficiently 
large, and 0 only in the case x = 0. Therefore the numbers A: , A: are linearly 
independent. On the other hand, by (34), (35), (43), (45), 


[> — Ar | = [az | 





Tr aid dal 
@-7 <|y|@? +") <e. 





Consequently IT) is satisfied, and III) for k = 2. 
By (84), (87), (88), we have 1, = 4. If we define 


E.= Ey (2<k <li), 


the | 


such 
of P 


of tl 


whe 


(48) 


L 
SaByé 
a = 


(49) 


whe 


(49) 
min 


(50) 


witl 


and 


om | 


(51 


(52 
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the conditions I) and III) are satisfied fork < i. Let m be any positive 
integer and assume, that the polynomials EF; are already determined for k < lp 
such that I) and III) hold. Consider now any power series E = EF, + E; + --- 
of P with these fixed terms Ey (k = 2,3,---,1m — 1). Let s be the integral 


eo 
s= De, & = un 
k==2 


of the corresponding canonical system. Then 


(s, Z) = 0, 
whence 
l 
Di (sn, Fizz») = 0 @ = 4 «.;) 
h=2 
as, 2a 88) 
“ di(u 28 "1 as) + na (ue 2 M2 
~ OE, OF, fo if 
= U Ou Vy on > (s, Euy2-n). 


Let Sasysurugoy v, Eapysusugny v, be corresponding terms of s; , £,, where 
Sass » Eagys are the coefficients. Then a + 6+7+ 6 = 1 and sqs,3 = 0 for 
a= y,8 = 6. From (48) we obtain 


(49) {(a “ y)M my (8 sa 5)A2} Sages - (a ne y)Eapys + bapys ; 


where b.g,s denotes a certain bilinear function of the coefficients of s;, --+ , 8:4 
and E;,---,E,1. Since &; is fixed for k = 3, 4, +++ ,ln — 1, we infer from 
(49), that the coefficients of s, for k = 3, 4,---,lm — 1 are uniquely deter- 
mined and that the same holds for the expression 


Sapys — th 
ap (a — ym + B— BAe 
witha + 8B +y+6= 1, anda — y, 8 — 6 not both zero. Take in particular 





(50) Eapys 


Pee ae er eee ae 
a= dn, B = 0, y¥ = 0, 6=-ra, if tn <0 


and denote the corresponding coefficients of s:,,, E,,, Et, more shortly by 
on er Then, by (46) and (50), the value 


(51) om — YmPn (Im — tm) = Ym 


is uniquely given. 
If we choose for 7m the two values 


(52) tm = tm + F€1y ; 
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we find, by (51), two values of om which have the difference gmpn'e:,,. By 
(39), (45), (47), the inequality 


| @mbm Ey | > $Qm4t | Ao | “ery > 2lm'™ 
holds. Hence we can determine the sign in (52) such that 
(53) | om | > Irie, 


By (44), we have lm < Imi. Fork = lm, ln + 1, +++ , ling: — 1 we define 
E, by 


Ei, = Ei, + dein (uius™ + i ofrvi"), if rn = 0, 
E,,, = Ej, + }e1,(ufror"™ + i'™vf"uz™™), ifn <0, 
E, = Ey la <b < lay). 


Now the conditions I) and III) are also satisfied for 1, S k < Imi. 
By this construction, a power series EF satisfying I), II), III) is uniquely 
determined. The corresponding integral 


$= p 8k 
k=2 
contains in the term s;,, the coefficient ¢,. By (53), the inequality 


[s,.| >In” (m = 1, 2, ---) 
holds. Hence the condition IV) is also satisfied, and the theorem is proved. 


INSTITUTE FOR ADVANCED STupy. 
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This extra number has 


This will be number 5 of volume 42. 
been made possible by the generosity of the American Philosophical Society. 


The Index for the entire volume 42 will be incorporated in the extra number. 
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few months. 


